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ABSTRACT 

The project was designed to investigate sources of 
available data on the incidence, prevalence, diagnostic conditions, 
functional limitations, and activity restrictions of the disabled 
population in the United States, and to design a comprehensive data 
system. Difficulties, both conceptual and practical in estimating 
incidence and prevalence are cited, including divergent data sources, 
varying definitions of disability and handicap, and inconsistent 
methodologies. Guidelines for extrapolating from existing data are 
given, including techniques of aggregation and merging, and linking 
files by entering distributions and summary figures into 
multidimensional tables. The report recommends steps to be taken by 
the, N^tional^ Institute of Handicapped Research to improve disability 
statistics, including advocacy for a uniform functional definition of 
disability and handicap and for inclusion of disability data in 
censiis and other national surveys. The development of a Disability 
Data Archiving and Analysis Center is detailed, and such a project's 
activities in nine categories are described: acquisition, cataloging, 
abstracting, reference, circulation, file processing, analysis, 
dissemination, and data improvement. Functional components as well as 
procedures, examples of user requests, and administration 
considerations are noted. Among five appendixes are papers on machine 
readable data files and a selected bibliography on the definition of 
measurement and disability. (CL) 
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EXECUTIVE SUMMARY 

In November, 1980r the National Institute of 
Handicapped Research (NIHR) awarded the Bureau of Social 
Science Research (BSSR) a one-^year grant to investigate the 
feasibility of establishing a comprehensive data system on 
the handicapped population of the United States. In the 
course of this work, BSSR staff members investigated the 
scope and. content of over 300 machine-readable data files. 
We were' able to provide NIHR ' ith detailed descriptions of 
42 major files and with summaiies of 45 others. The initial 
purpose of our investigation was to determine which files 
could be used as the basis for a demographic information 
system. We concluded that in the absence of the Post-Census 
Disability Survey, no existing file could serve as the basis 
for an information system which would provide immediate 
answers to queries from a terminal. We further concluded 
that it is nou possible to construct an omnibus "demographic 
master file" on the disabled population from existing files. 
Wc concluded that the goals implicit in the establishment of 
a comprehensive data system could best be pursued in^four 
ways: 

Data improvement . NIHR should develop a convincing 
argument to Congress and the Administration for 
implementation of a uniform functional definition of 
disability, and the inclusion of disability data in census 
and other national surveys. The basis of better information 
is better statistical data. At present , data on the 
disabled population are fragmented, incomplete, and 
incompatible. We recommend a set of statistical, data 
archiving, and technical policies to NIHR. ^ The NIHR 
statistical policy should reconcile NIHR's statistical needs 
with the availability of data, thereby generating a set of 
data collection needs and priorities. The data archiving 
and access policy establishes machine-readable data files as 
bibliographic entities, much like publications and other 
information media. The policy is the basis for NIHR's 
participation ^n the federal "information economy" as a 
colleague of established statistical agencies. 

A core multipurpose analysis agency . NIHR should 
become a core multipurpose analysis agency concerned with 
disability statistics. 

DIDAAC. NIHR should establish a Disability Data 
Archive and Analysis Center (DIDAAC), a facility for 
answering statistical questions concerning the disabled 
population. DIDAAC would provide the best available answers 
through a comprehensive collection of files, advanced 
statistical and computing techniques, and serious 



3 



scholarship by its* staff- We providers functional and 
organizational analysis of DIDAAC, sonie project cost and 
staffing levels, and a set of alternatives for its 
establishment in ^ N?HR, in another federal agency, or as a 
contract with a non-government agency. 

Demonstration project s NIHR should support ^ a 
demonstration project to test. DIDAAC * s capability in making 
projections of national estimates from existing data 
sources. 
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r. INTRODUCTIOI^- . 

There is presently an urgent and^^nmet need for 
accurate statistical information . on _the handicapped 
population of the nation. Currently, there may' be^as many 
as ten million. Americans so severely disabled that . -they need 
constant assistance. -Another 20 to 30 million Americans may 
suffer from some form of disabling or handicapping 
condition.. Such numbers rise an%>fali as Americans enter 
and leave the disability stream. Increased medical 
knowledge and technological innovations,^ coupled with bettet* 
serviced and legislation responsive to the needs of the 
ha h'di chapped, have made it possible for many people with 
disabilities to live productive -lives. ' , . 

Disability is here defined as a reduction in a major 
life activity resulting from an acute or chronic condition;)^ 
The severity of disability is defined by the degree of- 
functional limitation a condition imposes on a person, and 
the extent to which it affects a major li.fe -acjtivity. 
Clearly, the definition must begin with the pathology which 
causes an impairment. However, clinical conditions acquire 
significance only if they lead to a functional limitation of 
a major life activity such as employment, schooling, or 
housework. The intervening variables linking impairment arid 
disability are the unique characteristics of the person 
and/or._ the' physical, psychoiogi'cal , and socioeconomic 
environment in which the person, acts. Disability in this 
sense is a relative and dynamic phenomenon. What is a 
disabling condition for one person may notj be so for 
another. F^^urther, changing., circumstances, attitudes and 
aspirations, as well as rehabilitation, may reduce or alter, 
the impact of a corfdition that was previously disabling. 

-1" . ^ ^ 



* For a discussion of definitional and measurement problems, 
see John . Terry^ Chase, Engin Inel Holmstrom, ,and Albert 
H. CantriJ., Research Memorandum No. 1 ; An Approach to the 
Review - ot Data on the Demographic and - Epidemiological 
CharacterTstics of -^ the Handicapped for Purposes of 
Developing a Comprehensive Data" System . Washington, D.C., 
Bureau, of 'Social .Science Research, 1981. 
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For the purpose of this paper, we do not differentiate 
between disability and handicap* The current use of the 
term "handicap" is ts-i^xJ to eligibility criteria, reflecting 
legal definitions written into legislation* However, there 
is in this country a growing 'interest in the adoption of 
terminology developed by Philip H* N* Wood iEor the ^^rld 
Healt-h Organization. Wood' defined "disability" as "any 

, restriction or , lack ( resulting from impairment) of abilsTty 
to perform^ an activity in the manner or within the. ran^ 

^considered normal for a human being." He defined "handicap" 
as "a disadvantage for a given individual,, resulting from an 
impairment, or a disa^bility, that limits or prevents the 
fulfillment of a role that is normal (depending engage, sex, 
and social and cultural factors) for that individual*"* 
Handicap, in this sense,' becomes a social phenomenon, *based- 
on 'the interaction of tli^ disabled person ' with his* 
environment. Wood suggests tha,t the term "disaolemefft" bf; 
uBed as a collective descriptors referring to any expifience^ 
identified *by the terms "impairment," "disability," and" 
"handicap." Woods' suggestion "is currently' under,' review, 
and the problem of nomenclature and classification is yet to 
be resolved. " { 

^ \ 
During 'the 1970' s, the public became uncrVasingly 
concerned wxth the plight of .handicapped people. Opngcess- 
passed^ .many legislative acts, ^resulting^ in many nev and 
enlarged> programs/f (^"'handicapped people. By 1978, oMer , 80* 
different Feder'al^programs provided services to handicapped 
people either ^directly or^ indirectly.^ Many of [ these 

, programs coi-i^l. informatibn, but ^ each "has' a diTI^ent 

'legislative' mandate* and has developed different definitions 
of their service populatio'ns . ^No single Federal agency or 
mechanism, such as .th^> Census," copects uni'forjn and 
consistent information oil the . handic'apped people of the 
nation. As a result, only estimates are available on the 
number of people with different disabilities, and. relatively 
little is known 'about their, demographic and clinical 
characteristics, thiiir attitudes and aspirations, needs, and 
service experiences. National data on persons^^ with 
disabilities are needed to make informe.d policy decisions, 
and to guide research to' identify and . serve, better the 
handicapped *populati on of the^^nation. ' . 

Most policy ;decisions are allocation* decisions. They 
implicitly or explicitly state a^ set of goals,* their 
relative importance, and the rei^ative .resources to be 
expended in pursuit of each goal. Many policy decisions on 

— y — — . • , .• , 

' World Health' Organization. International C lassification 
of Impairments, Disabilities, and Handicaps . Geneva.: World 
'Health Organ izatwwi , 198.0:27-28. ' . 



^ Unpublished interagency communication. 
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handicapped ^ people concern allocations of resources. The 
prudence and equity of such decisions depend heavily on an 
understandi ng..of._thf ^demo.graphy^pi. handicap ped perso ns. 

One . set of d^a essential to <the equitable allocation 
of resources among the disabled pertains tto the incidence 
and prevalence pf various handicaps. How many people suffer 
from a particular condition at present? How many will incur 
a~cTo'nl3i~tliTo~nT:oyj5T^ of a year? Which conditions are 

"disabling" for persons with what Kinds of 'demographic 
characteristics? Accurate estimates of carefully N^etermined 
diagnostic conditions are ^needed, along with information on 
functional limitations and activity restrictions. \ What is 
the degree of , functional limitation, a condition injposes? 
What is the best method by which limitations can best be 
overcome by taking a particular action or set of actions? 
What are the demographic and background characteristics of 
people with disabilities, their geographic distribution, 
racial and ethnic background, educational and , economic 
status, family situation,- etc.? 

Statistical data on the incidence and prevalence of 
various handicaps and their dempgraphi-c . correlates 
constitute only one kind of information essential to policy 
making. A second level of more complex and relSLtively 
subjective information is requi-red to understand the 
disability phenomenon and to establish *the mix of support 
s^ervices and programs that will be most successful in 
helping people to cope with or overcome particular 
conditions. We need to know about diagnostic condition(s) 
from which the disabled person suffers , and the Junctional 
-> limitations and activfty restrictions .with which he or she 
has to cop.e. .Further, we must examine the social situation 
in-^hich he finds himself. The attitudes and resources of 
the family members and those of the person^himself play .a 
great ^role in defining the situation and determining the 
. types arid extent of his service nee.ds. 

\ <s 

" Both quantitative and qualitative information are 
required to assist in the allocation of resources and in. the 
jeronduct of research essential for the development of better 
/services. Further, such information helps to inform and 
shape public opinion by establishing a common ground of 
understanding about the size and character of the nation's 
handicapped population. At ^present there is no single 
source pf such data. The available data sources suffer from 
various definitional and measurement problems, and the 
Jn format ion that they provide i? of dubious quality, lacking 
the accuracy required' of national statistics. 

The legislation establishing the National Institute of 
Handicapped Research (NIHR)was designed^to provide the means 
for developing an adequate statistical bas.e for policy. It 
specifically gives NIHR the mandate for: 
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producing, in conjunction with the Department of 
Labor, the National Center iEor Health Statistics, 
the Bureau of the Census,, the Social Security 

-Admin i-s tra t-ion-, ^and — =0 the r^ - federal ,de'pa r.tmen.ts_a_nci 

agencies, as may. be appropriate,, statistical reports 
and studies on the employment, health, income, and 
other demographic characteristics of handicapped 
persons an^ disseminatingsych 'reports and studies 

-to— rehabi4d~ta£ion-^professlonaIs.-^andLoi.he.r_^^ assist 

in the planning and evaluation of vocational and 
other rehabilitation services for the handicapped* * 

It is against this backdrop that NIHR :supported a one- 
year • planning project' at the Bureau of Social Science 
Research (BSSR). As specified in the program announcement 
under which the 9^^^^ was awarded, "the purpose of the 
project is to desigh a comprehensive clata system which 
incorporates data from a number of sources (systems) on the 
demographic and clinical characteristics of the disabled 
population." The program announcement described the purpose 
o'f the comprehensive data system as: 



1. To identify the epidemiological patterns of disability 
■ and related problems, including background and 

identifying information, type of disability and 
functional limitation, (e.g., vocational, independent 
living), as well as specific types of physical or mental 
impairment. 

2. To identif]^ patterns of need and problem identification 
as perceivea by the. disabled population, including need 
for services and 'benefits and problems of health and 
restoration care, income, transportation, etc. 

3. To identify the residual capabilities and strengths of 
the disabled, particularly in relation to vocational and 
social potentials and as;pirat ions; 

4. To identify the experience . of disabled clients with 
service organisations, problems and needs 'concerning 
independent living, housing, legal matters, work, 
transportation, recreation ,^ etc. 

During the course of the project it became evident that 
there were very few, if any, national sources of information 
on" needs, residual capacities and strengths, and service 
experiences of disabled populations. The emphasis of the 
project was shifted to identifying data sources that could 



* Section 202(b) (8) of The Rehabilitation, Comprehensive 
Services and Developmental Disabilities Amendments of 1978 
(P.L. 95-602). 
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provide national estimates of the incidence and prevalence 
of disability, as well as other identifying information on 
the demographic characteristics of disabled Americans. 

--thrs report is the -summar-y document of. -the. pr.oj^ect.. JJb 

is in four parts: In Chapter II, we highlight the 
difficulties involved in providing reliable estimates of the 
incidence and prevalence of disabilities in the United 
States, discuss the shortcomings of available data sources, 
end^ present a brief' description of information, needs, and, 
gaps- in this field. In Chapter III, we suggest techniques 
for making national estimates from existing data. In 
Chapter IV, ; we present our recommendations to NIHR, 
including a discussion of statistical policies and 
coordination that NIHR should develop in this area. In 
Chapter V, we present a detailed description of the proposed 
data archiving and analysis center, including alternative 
institutional arrangements and other considerations relevant 
to Its implementation. 

Appended to the report are several planning documents 
in support of various aspects of the operation of eT^ central 
statistical facility. 
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II • ESTIMATING INCLDENCE^ AND PREVALENCE 

— ^~^~~'Atr present, -^e- -hay e--nO' -sin gie source-o£ adequate data 
on the. incidence arid prevalence of disabling conditions for 
the population of the United States* On the one hand, there, 
IS both underenumerat ion and pverenumeration of disabling 
conditions because of mult iple ^impairments , and because of 
variati'on^ in deflni tXonaL thresholds that govern^ inclusron- 
in or exclusion from programs. On the other hand, the 
statistics that do exist cannot be effectively aggregated 
because of inconsistencies in data sources and in methods 
employed -in collection and representation. As a result, 
only "gross national estimates are available, and only in 
selected instances can a demographic .profile of those wit,h 
part icular^impairments be obtained.* 

This section of the report identifies the conceptual 
and practical problems associated with currently available 
data sources. This section discusses obstacles to obtaining 
national estimates from such sources, and briefly outlines 
major information gaps. ' The next section- includes a 
discussion of techniques that might be used in reducing the 
uncertainty of estimates. 



Conceptual And ' Practical Problems 

How many handicapped people are there in the'' United 
States? According to the Social Security Administration's 
1972 Survey of Disabled and Nondisabled Adults, 7.7 million 
adults of working age had functional liinitations that can be 
deiEined as severely disabling. The same survey identified 
another 3.4 million with less severe occupational 
c3isabili ties., and 4.3 million with secondary 'work 
imitations,, raising the total^estimate of work disability 
among adults to just over .15 million.* Four years later, the 
Bujreau of the Census' Income and Education Survey reported 
16^.6 million people with some degree of work disability. It 
also reported 28 . 2 mill ion persons with some form of 
activity restriction.'' 



* Rehab Group, Incorporated* Digest of Data on Persons With 
Disabilities . Washington, D.C. U.S. Government Printing 
Office, 1979. 

* Donald T. Ferron. Disability Survey 72. Washington, 
D.C.: U.S. Government Printing Office, 1981, p. 9. 

' Rehab Group, Inc. Og Cit p. 16. 
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Estimates for work-disabled people have ranged from 12 
miirion at. the beginning of the 1970's* to 21 million in 
1978.' Estimates of the prevalence of disability usually 
vary from 13 percent to fl percent depending on the 
reference source used. ^ The '^Rj^hab Group, incorporated, has 
noted tha^t there are over' 41 '^million impairments in the 
UrTitecf States, but because multiple ampairments are coirimo'n^ 
,the number of persons with impairments is not known. 

Given the disparate nature of these national estimates 
o.l. _ _.t he. _ _i:ncji_de_n_ce_oJ "handicaps y/" it is n ot s urprising that 
detailed breakdowns of kinds of impairment or the 
demographic attributes of persons with one or another 
impairment pose exceedingly complex questions regarding 
sources of data, measurement techniques, and estimation 
procedures . 

The ability to make- national estimates and provide an 
overview of the demographic ^and epidemiological 
characteristics of the disabled populfation would be greatly 
facilitated by the existence of "^h /integrated data file. 
Such a file would contain records on individuals linking 
data on medical conditions; impairments; limitations of 
function; and work, school, and self-care disability to data 
on income, service experience , service costs, etc. It would 
carry identifying information on*- each/, record which would 
allow aggregation into geographic and other functional 
units, and linking to other- files v^based on those units. 
Such a -f ile cpuld be the basis , Lor a^.: "query system" in which 
those requesting information cou:ld retrieve demographic, 
cost, income, and benef it statistics in response to commands 
entered at the keyboard. 



*' Saad Z. Nagi, An Epidemiology of Disability Among Adults 
in the United States, The Milbank Memo rial F und ^_ .Qua rt e rly. ,. 
1976, Vol.54, No. 4, Ea 11-, -4-3 9-46 7-;"— 

^ U. S. Department of Heali^th and Human Services, Social 
Security Administration. Work Disability in the United 
States; A Chartbook . Washington , DC: U.S. Government 
Printing Office, 1980.' (SSA Publication No. 13-11978.) 

R. A.. Dudek and W. M. Marcy. "Public policy and social 
responsibility with regard to rehabilitation and maintenance 
of disabled persons." Technological Forecasting and Social 
Change , 1980, Vol. 17, 61-72. 

Rehab Group, Inc-» Cit , p. 2. 
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In the course of the project, we identified more than 
300 machine-readable data files, providing information on 
incidence and prevalence rates as well as needs, 
aspirations, and service experiences of. disabled 
individuals. We undertook a major effort to collect 
background information on all of these files in order to 
conducts— ja— me t-hodoiog-ica-1 — screen-i-ng* Howeverj such a task- 
proved to be too . time-consuming for a one-year planning 
grant. , We were * able to provide NIKR with detailed 
descriptions of 42 major files and with s.ummaries of 45 
others. Appendix D presents a listing of 274 -data sources 
^pgrwhicK- som^ bac^k^grlDund information was obtained. These 
data^ sources are organized according to their sampling 
design and sponsorship as: National-Federal, National non- 
Federal,„ Multi-regional, State, Local, and International. 

Our investigation of these data sources showed tkat no 
existing file could fulfill the functions of an ' integrated 
file as initi-^lly envisioned. In lieu of such a single 
file, we explored the feasibility of constructing a 
"demographic master file" from existing statistical data on 
the disabled. Our aspiration was to ascertain whether such 
a master file couic3 be constructed from other existing da^ta 
files by record merging^ ^ and concatenation.^* The file 
would identify disability in the most general functional 
sense (i. e., functional limitations restricting a major or 
related^ life activity) and would minimally contain 
iniEqrmation on sex, race/ethnicity," income, education, 
family status, and geographic residence. 

We examined 29 relatively recent national or multi- 
regional surveys conducted since 1975 that were concerned 
with disability either directly^ or indirectly. Table 1 
summarizes information provided by these surveys. Our 
review of these 29_ data files showed that they offered 
little prospect for the creatiorf at a "demographic master 
-^lile..." _jrhe_mfijpT d i f f i c u 1 1 y a£i s e s ^f^ om the limited purpose 
of various si;rveys and the resulting ilicon'sTstencies or- 



Machine-readable data files (MRDF) are set^s of 
information generally stored on punched cards or on ma^gnetic 
media such as tape or disk. 

Record merging is the construction of a record from two 
or more records containing information on the same unit of 
observation. For example, a single file could ^ be 
constructed by merging records containing medical 
information on individuals with record's containing income 
information on the same perspns. Both the two original' 
files and the merged file would have one record per person. 

^* Concatenation is the combining of two files end-to-end. 
The files must be in the same format. 
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incomparabi lity among items used in defining disability. 
There is al:so some inconsistency in the representation of 
demographic information. A related problem is that of 
estimating national population parameters, since most of the 
surveys are not based on samples of the U. S. population, 
but represent specially defined subgroups within the 
popul-ati^on'. For instance r our of 29 ma^jor f lies, DecenniaT 
Census is the only survey based on a national sampl^; 
sampling- designs of f ive other surveys- are based on the 
total civilian, non-institutionalized population ot the 
Unrted^ States. Mos t of the Social Security Adminisitrat ion 
surveys are limited to program beneficiaries; other surveys 
.may only cover a limited age group,, or people with specific 
problems, or in specific settings. 

Three classes of problem confound the ready linkages of 
"these, files: (1) Divergent sources of data, (2) varying 
definitions of disability, and (3) inconsistent 
methodologies. Next, we explore the nature of each of these 
problems. 



Divergent Sources Of Data 

Demographic information on the handicapped population 
is gathered from four sources: service eligibility 
statistics, service delivery statistics, population surveys, 
and ad hoc studies. 

Service eligibility stati^stics . Service eligibility 
statistics are generated by applicants to agencies concerned 
with providing for handicapped and disabled persons. In 
order to have any data trace at all, the person must be 
broughc into contact with the agency through his or her own 
efforts, the agency's efforts, or the efforts of some third 
party. During the initial period of contact, the person's 
^characteristics that are of interest to the agency are 
recorded and a dec^ision made as to whether, the person is 
eligible to be a client. If the person is accepted as a^ 
clien^V.then a, complete set of personal data will be 
obtained.^ If the' person is rejected, then the data record 
may be lerfc^ incomplete . 

' . \ ' ' 

There areNiiany. problems inherent in ser\rii:is 'eli^ibi-H-ty- 
statistics. Firs^t^ a person applying to several agencies 
wi 11 generate several eligibility records and may thus 
appear more than, oncexin a file constructed by concatenating 
eligibility records. \Second, the quality of service 
eligibility statistics ^s^ffer from lack of standardi zat ion 
in both dec'^ision making and^^^cord keeping processes. For 
example the quality of the K5A-300 data collected by state 
agencies for state- federal \^programs of vocational 
rehabilitation varies greatly Srom state to state, and 
reduces the comparability of mergedNstate data within .the 
national RSA-BOO " systsm. WhileX service eligibility 



Toward A Comprehensive Data System 11 



statistics can provide much valuable information about sone 
characteristics of persons with disabilities, it is unlikely 
that such records ^ can provide national estimates of the 
incidence and prevalence of disabiLity. 

Service delivery data • Once a client has been 
enroidedT — -the^-^agency^-'must -ma-i-nta-i-n^ records on what services 
have been provided* Service delivery data present many 
problems for -making national, projections* The problem of 
overcount may be less significant *than is the case for 
eligibility statistics, but the quality of data suffers from 
Tack of starfdaFdilsatTon , and" the^popul^ation covered includes 
service beneficiaries only* No information is collected on 
others who may be qualified but do not participate in the 
program for- one reason or another, nor on others who are not 
eligible for such services. Examples of service, delivery 
data are those collected by the Social Security 
Administration and the VeteralT'"s^ Adman i^strat:i on . 

Sample surveys . A number of surveys sampling the 
U.S. population have collected data on disabilities, either 
directly or indirectly. The Decennial Census and the 
National ^ Health Interview Survey are examples of such 
efforts. .Most of these surveys present some problems. For 
example, the Decennial Census asks limited questions 
regarding disability. In the ,1980 Decennial Census, only 
two very broad questions were asked concerning people with 
"any activity restrictions" and public transportation 
limitations. The 1980 Po,st--Census Survey that would have 
provided a rich and accurate source of disability 
information was cancelled after pretesting. 

.Short of such targeted census-^type- sur^veys^ the cost^--of~ 
teasing disabilit y data from ot her n ational surveys may be 
prohibitive. The weekly NaTEional Health Interview Survey 
(HIS) is rich wi th . diagnostic information; however, unless 
cross-tabulated with various other questions concerning 
disability days, restricted-activi ty days, etc., such data 
might exaggerate the extent of ' disability within the 
U.S. population. The HIS was recently redesigned' in an 
effort to alleviate such over reporting. The new format 
includes detai led information on conditions resulting in 
activity restrictions and functional limitations. However, 
impending budget cuts may delay the implementation of this 
new format. 

In general, surveys that provide detailed diagnostic 
information on respondents (usually targeted health surveys) 
do not provide -adequate information on functional 
limitations or activity restrictions. Since the existence 
of a chronic . .condition cannot always ^e equated with 
disability, the' results of such surveys must be carefully 
interpreted. On the other hand, most of the national 
studies that ask for. major life activity restrictions or 
limitations do not attempt 'to determine the etiology of 
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disabling ,,conditions (see Table 1). As a consequence, the 
currently available national data sources provide either 
inaccurate or incomplete information on disabled persons. 

"^xAd hoc studies. In addition to these da'ta sources, 
there are occasional or ad hoc surveys, primary or secondary 
in nature, that are undertaken by demographers working in 
-^un r ve rs~i-t-i e s- — or~^na.txo_n a_l org an izations, particularly those 
representing various- disability~grWpls~("SUch-as- -the--H.aJ:j5na^l^ 

Association Jor the Deaf). There are also a rich array of 

state and local studies^ wh'ich*, a-Lthbugh .not sampling 

national populations, provide descriptive informatiort on 

Wrrous — aspects — -of^^disabllitXr^ such^ as attitudes of others 
toward people with disabilities and proble^Tis — of- disabled^ 
people related to housing, transportation, recreation, etc. 
(See Table 2 for example of such studies.) 

Since most of these ad hoc studies are not based on 
national samples, they cannot be used for enumerative 
purposes but do provide some useful descriptive information 
-on the characteris^tics and experiences of people with 
disabilities . 

o 

' ' Varying Definitions Of "Disability" And "Handicap" 

A consequence of the programmatic origin of most data 
on impairments and disabling conditions is that def ini^tional 
issues tend to be resolved at the stage in which a person's 
eligibility for participation in a program is determined. 
Agencies providing ^services to the disabled quite 
.legitimately pursue the legislative intent of their special 
mandate. Consider three definitions written into law: 



' 1. The Social Security Act defines disability as "the 
inability to do any gainful activity by reason of any 
medically determinable physical or mental impairment 
which can be expected to last for o continuous period of 
notj i^ess than 12 months. To meet this definition, you' 
must have a severe impairment, which makes you unable to 
do previous work or ^any other, substantial gainful 
activity which* exists, within the national economy." 
(Section 404.1505) Further, applicable regulations 
stipulate: "If you are working and the work, you are 
doing is substantial gainful activity, we will find that 
yoii are not disabled rega'rdless of your .medical 
condition or your age, education, and work experfenv^e . " 

2, The "504" regulations of the Department of Education and 
of the Department of Health and' * Human Services 
pertaining to civil . rights matters stipulate: 
"'Handicapped persons' means any person who (i) has a 
physical or mental impairment which substantially limits 
one or more major life activities, (ii) has a record o.f 
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such impairment; or (iii) is* regarded as having such an 
impairment." \\n the regulations operationalizing this 
definition, an impairment " is * defined as: "any 
physiological ^disorder of condition, cosmetic 
disfigurement, or ^anatomical loss a^ffecting one or more 
• body systems.'' (CFR, Vol. 45^ No. 92, p. 30937) 
Further, "has a record of such an impairment" can refer^ 
either to having a history Of substantial limitation or 
having been "misclassif ied^ as having a rn^ntaL or 
/ * physical impairment that substantially limits >^ne or 
more life activities." And, "is regarded as ha\ing an 
impairment" can refer to an impairment "which does not 
ni^E~aG-t,..._substantially limit major life activities, but 
that- is treated^ by a (government) recipient as 
const ituting^ such^a^Hmitation . " 

"^3T-The_ definition of the ^^Rehabilitation Services 
AdminTsSratipn views^ a "handicapped individual" as 
someone« who "(i) has a physical cr mental disability 
which for such an individual constitutes or rfesults in a. 
substantial, handicap^ to employment and ^-(ii) can 
reasonably be expected to benefit an terms oiE 
employability from vocational services." 

The RSA regulations add the statement "or for whom an 
extended evaluation of rehabilitation % potential is 
necessary for the purpose of determining whether he 
mi'ght » benefit • in terms of employability from the 
provision of vocational rehabilitation services." • The 
vocational rehabilitation definition emphasizes the_ 
rehabilitative and employment potential of the 
handicapped person, and excludes: those who do not show 
such potential. 

Individual disposition and attitude are obviously critical 
"in this determination. ' Additionally, the final decision 
regarding eligibility resides with the State i^otational 
rehabilitation counselors. \ 

Thus according to the Social Security Administration, 
one is disabled only if gainful employment has been 
precluded for a year by an impairment. The Rehabilitation 
Services Administration's definition factors in a person* s 
disposition and moti^>ation in determining reoson^ble 
expectations of the person's employability potential as a- 
result of participation in vocational rehabilitation 
program. The Department of. Education and the Department of 
H6alth and Human Services have the most liberal definition 
for civil rights purposes: Any physical or mental impairment 
substantially limiting "major life activities" is consider^ed 
as a handicap. Further, section 504 of the Rehabilitation 
Act includes not only those now restricted by an impairment, 
but those .who may be restricted by having had or having been 
regarded as having such an impairment. Haber states that 
rsince the language of the legislation goes beyond work and 
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VR services to include a broad array of social seirvrces, the 
population covered against potential discrimination may 
include infants, students, and the elderly, as well ^s those 
whose handicaps preclude them from employment,''^* Indeed, 
the regulations are unequivocal in^ including these age 
groups.^ 

The practical consequences are twofold: different kinds 
of people fall within the administrative record systems pf 
these agencies, and the kinds of information recorded about 
each person differ according to the agency's mission. Thus, 
while the administrative f iles of the vocational 
rehabilitation agencies do record the nature of the client's 
specific impairment (e.g., hearing loss), they will not 
provide a true demographic .record of the incidence and 
prevalence of. disabling conditions, as both most ^nd least 
severe instances of ^ handicaps are excluded from 
consideration.^ . 



Inconsistent Methodologies " ^ 

Definitional questions almost always have implications, 
for how measurements are made. At least three facets of the 
problem should* be noted:- what variables are measured; ^ what 
questions are^ asked or criteria invoked; what units pf 
measurement are' employ e.d? Some surveys rely on existence of 
a condition (based on diagnostic categories and/or 
impairments) and evidence^ of limitation in life activities. 
Others concentrate on limitations without identifying the 
specific physical or mental condition or conditions leading 
to disability. Thus, it is ^difficult to obtain accurate 
national .estAmates on* the numbers of disabled people who 
have specific handicapping conditions or impairments. It* is 
easier to estimate the number of people who have work 
limitations or other limitations. , However, limitations are 
not always identified in a consistent manner. Some surveys 
simply ask: "Does your condition limit the amount or type of 
activity in which you engage?" Others ask detailed 
questions on limitations affecting areas ranging from 
transportation to daily living nee(3s, recreation, etc. 
Further, surveys differ in their coverage of reasons or 
conditions that result in work limitations or , other activity 
restrictions. Such variations make any aggregation or 
comparison ofv results very difficult. 



Lawrence D. Haber. "Disability Concepts: Implications 
for Program and Policy Development" in Definitions of 
Pathology, Impairment, Functional Limitation, Disability . 
Mary E. Switzer Memorial •* Series No.l. Washington, D.C., 
National Rehabilitation Procedures, 1980^ p. 24. 
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pther methodolocfical differences between surveys, ^such 
as variations in ^ demographic question's/ also make 
aggregation difficult* Some surveys ask for familjf Or* 
household income, while others inquire about personal 
income. Educational levels are reported in a ^ variety of^ 
ways, which are not always -compatible across files. As a 
result, there are serio.us gaps in ' our knowledge of 
disability and in our ability to make projections for 
handicapping conditions. 

^ vl^nf ormation Needs 
Information needs of the NIHR ihclide data pnr 



a. The prevalence and inciden-/e of handic^apping conditions', 
including background and identifying information, 

4 ' - 

b. Patterns of need and problem identification, 

\ ' "* 

c. Residual capabilities , and strengths of the disabled, 
particularly in relation to vocational and social 
pdtentia-k5 and aspirations^ 

d. Experience. of disabled clients with service 
orga'nizations, problems, and needs concerning 
independent living, housing, work etc'.' 

' Minimally, background data on disabled persons . should", 
include -information - on geographic location, age, 
race/ethnicity,' and family income. Knowledge of these 
ch^rcCterist ics is essential to eri understanding of- 
handicapping conditions,, to needs assessment, and to the 
development of appropriate programs. Knowledge about the 
location of affected persons indicates where and through 
what political channels resources will be allocated. 
Information on the^age and income of affected persons gives 
an idea of the length of time resources will be consumed in 
their support and on the degree to which disabled .persons 
can contribute financially to their own relief. Such 
information is also useful in as'sessirfg the overall economic 
consequences of disability and the ' potential costs^ and 
benefits averting it. Information on race and ethnicity 
.addresses questions concerning the social equity of the 
-distribution of government aid Lo the disabled. Such 
information can also pinpoint specific Cjoilnerability , ^ such 
as the higher susceptibility of blac*ks to high blood 
pressure, and of fair-skinned persons to skin cancer. 

One of the paradoxes of data for national decision- 
making is that the data of interest to those with relatively 
broad responsibilities are often collected and maintained by 
those with specific programmatic respcrfsibiii ties. The 
situation r<>sul.ts in three types— of information gaps for 

L 
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policy makers and researchers: Gaps in population covered; 
gaps in information: covered, and gaps ^ in disability 
— ^c'a't'egories cbVerid. The last is .perhaps the most serious. 

arising from the lacJc^of_ cp.nsistent and systematic use 
^deiinitional'-t'e'rms^ and assessment methods. 

« Gaps In Population Covered _ 

There are serious ..gaps in the population of disabled 
people cover.ed^by many of the national surveys. A truly 
comprehensive data system on handicapped people requires^" 
. information on the full range of, disability for people of 
all ages.. A major, information gap concerns those people 
with disabilities^ who- do not fall within ^ the domain of 
various, programs, .either because they fail to meet program 
eligibility criteria, dp not^know^ about\ the programs, or 
despi'te^^ their limitations and disadvantaged situation, do 
not consider themselves handicapped and; in need .of services. 

' Another information gap concerns the elderly and the 
very ^ young. Much of the data on the elderly come from 
surveys of institutionalized .persons. Since, most elderly 
peoplfe ' are not institutionalized,, but livg"*by themselves or 
with fa^ly *memb.ers, such surveys pro^1xte:;j:X)nly limited' $nd 
\ biased" rnform'ation about the number of di|afciled.elderly "and 
^conditions they^face. Similarly, surveys on\\tlTg^very young' 
. ^tend to focus on restricted populations, .si^ asV^hose with 
^ congenital defects detectable at birth or thoj^e-wit^h hearing 
•impairments. In recent years, much additional information 
has been developed on handicapped children of school age^^ut 
the classifications are usually stated in educational terms^ 
and are not well correlated with etiology. More information 
is needed on the children and youth of" the nation and on- 
what constitutes handicapping conditions at various ages. 

Gaps - In Information Covered 

With^ minor exceptipns, : none of the national surveys 
provide information on residual capacities and coping 
- mechanisms,' heeds, or service experiences of handicapped 
people—all. priority areas ais st^ajbed in the NIHR grant 
announcement. Some notable, exceptions are the National 
Survey of Transportation of Handicapped People, sponsored by 
the Department of Transportation; the iSheltered Workshop 
'Study,.; ^sponsored by ' the Department of Labor; and the 
Comprehiensive ServTC^er^-Jleed^S" Study, sponsored by the 
''Department of Education.. < 
«f . ' . . 

/ Various local studies also provide descriptive 
ihf ormat ion :* e .g. , the Kentucky Assessment of Vocational 
Education Needs of the Disadvantaged and^Handicapped , the 
, Wisconsin Survey of Paratransit System Serving the Elderly 
. • ^#nc( Handicapped, and the Assessment of the Unmet Needs of 
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the Aged Blind Population in New Jersey. Table 2 presents 
example of such state and local studies. 

It is also, difficult to aggregate the results of these 
studies, as each varies in the ways in which essential 
demographic information jucji^ as .age or income, is asked. 
.Finally-,- -very; few nari"on^l studies collect information on 
etiology, age on onset, and secondary conditions, although 
such information is essential to understanding and tracing 
the dynamic nature of handicapping conditions. 



Gaps In Disability Categories Covered 

The multiplicity of classif icatory schemes and the lack 
^of ^ standardized c'riteria for the assessment' of disability, 
coupled with the programmatic focus of many national data- 
collection- agencies* have resulted in data sources that are 
inconsijstent , incomparable, and riddled with problems.. 
While ' some surveys provide information on diagnostic 
categories, with insufficient ref erence" to any accompanying 
activity restrictions, others concentrate on functional 
limitations with insufficient information on the etiology of 
such- conditions. ,Currently, no national studies provide - 
information on all conditions, and on the type and extent of 
functional limitation that ^ such conditions impose. ^ An 
excellent source for sucS information is a well-designed 
census or national sample survey. The population of 
, interest in gathering basic data on the incidence and 
'prevalence bf handicapping condi'tions is thBt ^of all persons 
living in the United States. It is known that the, kinds and 
degrees of disability yary with geographic location. It is 
apparent that reliable results require, at a minimum, ^ a' 
sophisticated sampling design us^ing samples of adequate size 
and a cadre of "well-trained interviewers. Such efforts are 
costly. If they are to be cost-effective, such surveys must 
be plaiined with a view'to the multiple use of their results. 
Such filanning requires a national perspective and the kind 
of consideration, that only national authority can provide. 
Unfortunately, no agency concerned with the^ handicapped, 
including the NIHR, has any mandate to survey this 
population. It is important to realize that NIHR^s mandate 
to produce reports in conjunction ,with other agencies that 
routinely collect statistical data is not the same as having 
the mandate to survey the disabled population of the United 
States. ' . • 

The definition of ^^handicapped" is currently tied' to 
eligibility ci:iteria.. It is defined in terms of one*s^ 
entitlemeivt to a program or service. Yet disability should 
be approached as a real personal characteristic of human 
performance and activity/ and .as such, should serve as an 
important indicator of the health status p£ the nation. The 
process of ^idencifying handicapped persons and handicapping 
conditions' shoulo^^ be separated from the process of 
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_ EXAMf US 0F ,STATE^t.,LOCAUST0DIES.W|THJNF0RMTI0N.0N.THE.MANOICAP^ 



Prevalence t Incidence Kales 
<> Characteristics 


Needs 


Aspirations 


Service .Experiences 


A 1 1 i t udes^ f ^0 t!>e r s 


deg'stry of 0e«r.8lind Children (AK) 
CES-::2Hqs Client InfonMtlon Syste* 

in) 

Crlr^Ied Children's Services Prooran. 

{-J . 

CatiVornia OisabilKy Survey (CA^ 

Case ^e^orting of KH(0) H««olytlc 
Cesease (CA) 

^ttc't'trj System^for Handicapped 
S:«-':ents (CO) 

Cey«2s:»'^ntal Disabilities Infoma- 
ton Systen (IL) 

S^r.e/ of Severely Disabled In Iowa 

Lo««s«a''> Survey of Hearing Irpaired 
a-2 Cca'-31ind Cri Idren^and Youth 

'•assachcscCts Hental Retardation 
Srr/;^«: C'X/rdinac ion Battery (nA) 

Ser/tces fof Children and Handicaps 
i'i^^'i^-i'it lnfor.--»tion Systen (KM) 

S'-r.e/ 0^ Sensory, ir>paired Chi Idren 

Ce<e*C3'«'*tai Disabilities Informa* 

1377 Tele^Hcte Sur/e/ on Developmental 
lui^: iiti^s (CH) 

•37—76 foMow-ua of the Physically 

''ii^iicwt^ (PA) 

^'tt^e'^t-Ciient Information Systen (PA) 

2ro^n university Health Surveillance 
Sc^riies ih) 

Hinc«C4?Pfd (Children) Information 
Ce-ter, (TX) 


Services for Handicapped 
Children and Youth (lA) 

Adult Restorative Services (KS) 


Postsecondary Educational Status 
* of Physically Disabled Adultf 
(PA) 


Hot Springs Rehabilitation 
Center Follo»«*up of Former 
Rehabilitation Clients (AR) 

UCIA Intervention Program for 
Developmentelly Handicapped 
Infants Young Children (CA) 

Household Survey on Effective* ^ 
ness of HeelUi Services (TN) 

" R5Ar300 Series 


Pennhurst Telephone Survey on the 

At tl tudes-Towar4-the>^entel ly*j — 
. Retarded (PA) ! 


Kentucky Assessment of Voca* 
tional Education Needs of the 
Disabled and Handicapped (KY) 

1976 Needs Assessment Study 
(Hass. Commission for the 
Slind - KA) 

Hinnesote Disability Assessment 
(«N) 

Survey of Handicapoed Indivi- 
dual's Housing Preferences 

Survey of Paratransit Systems 
Serving the Elderly and 
Handicapped (tfl) 

Assessment of Vocational Needs 
of Handicapped Students (Wl)' 


Gecontology Retlreieent Study 
(CA) 

The Job Hunt of the Disabled 

(HY) 


Survey of Children's Attitudes 
Toward Handicapped Peers (h|) 

Cook. County Survey of .Teacher 

Attitudes Toward Handicapped 
Children (IC) 

Survey of Landlord Attitudes 
Toward Housing for Hentally 
Retarded Persons (NYC) 


Western Arkansas Study on Aging 
(AR) 

Aging Needs Assessoent (CA) 

Transportation Needs of the 
Mobility United (ILL) 

Healthy Income & Related Prob- 
lems of the Elderly (KT) 

Howard County Handicapped^ 
Needs Assessment (HO) 

Transportation Needs of the 
Handicapped (Boston, HA) 

The Elderly In th% Inner City 
(MYC) 


Sai 5ieco County Teratogen Registry (CA) 

»*e<ic«l and Allied Health Services 
Oeli/er/ Systen for Substantially * 
Ha'^i cooped Developmental ly Disabled 
Aivlts (Chicago - II) 

Pcc^'ester Project (Wi) 

1979 Surrey of Disability (Houston, TX) 

Statistics on Spinal Cord Injury 
Pa:Ients (tx) 



*Ho date sets identified discrlbing residual capacities coping mechanises, 

• 30 
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determining service eligibility. If disability measures are 
to be used as national indicators of health ^ status, such 
measures must be , uniform in definition and measurement. 
Data collection activities should be- based on census-type 
national samples. 

The 1980 Post-Census Disability Survey was designed 
with these-^considerations in mind. It has been pretested 
and ' offered great promise of producing the kind'- of 
demographic master file that could be employed in addressing 
general questions of incidence and prevalence. The Post-" 
Census Disability SurVey would also have served to answer 
many specif ic - inquiries regarding the attributes of those 
with particular^ impairments. For budgetary reasons, 
however, the Survey-was. not conducted. Additionally, in 
early 1981, eight* other surveys were either in the planning 
or developmental stage. At the time of '^this writing the 
surveys and' th.eir statuses are: 

!• The Survey of Impaired Individuals in Households (DHHS), 
reduced in scope; 

2. The Medical Examination Study of Extent of Impairment 
and Potential SSDI Eligibles (SSA), funded for 1982; . 

3. The Work Incentive Experiments (ORS/SSA) , cut down and 
to,, be done in-house; 

4.. The Framingham^ Study Disability Analysis (ORS/SSA), to 
consist primarily of data archiving; 

5. Alt'ernative Practices in Disability Beneficiary 
Rehabilitation (SSA)> cancelled; 

6. The Survey of Income and Program Participation 
(ORS/SSA) , cancelled;^ 

7. The Survey of Individuals in Residential Institutions 
(DHHS), cancelled; 

8. A Feasibility Assessment for a Longitudinal Survey of 
the Disabled (SSA), cancelled. 

9. The Survey of Impaired Individuals in Households (DHHS), 
cancelled. 

Further, the newly reformatted National Health 
Interview Survey (HIS) might also not be implemented. The 
new format is designed to .provide much needed detailed 
information on activity restrictions* and functional 
limitations of various impairments. It also places a 
greater emphaS^is than before on determining disability in 
those 60 or over and those ,bet*een the ages of 5 and 17, 
both of which are relatively under-surveyed populations. 
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Howe^r,^ budget cuts threaten HIS as well as other well- 
est^lished national data systems, such, as RSA-'SOO. 

This section discussed the problems associated with 
cur^n.tly available data sources on disabled persons. 
Ideally, the most accurate source of national information, on 
disabled persons and on the incidence and prevalence rates 
can te obtained from a census-type survey targeted to 
idenrtif ying., and describing, disability. Since there are no 
siicn survey's and the likelihood of undertaking one in the 
near future* is almost nil, we now turn our attention to 
teasing estimates and demographic profiles of disabili*"y 
from existing data sources. 



J 



\ 
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III. TECHNIQUES FOR THE USE OF EXISTING DATA 

Throughout the course of this project, we have been 
driv.en by three objectives: identification and description 
of data files, ' consideration of how best to estimate 
incidence and prevalence,, and design ^ of a "comprehensive 
data system." The tas^ of- actually estimating incidence and 
prevalence was beyond the. scope of our project. We have, 
however, outlined the procedures that could be employed in 
such an exercise. 

General Guidelines 

Two general rules should be followed while attempting 
to. make national projections from available datia sources on 
the disalDled'r- First, the focus of inquiry must be well 
formulated; second, acceptable levels of confidence must be 
precisely stated. • . 

Narrowing The Focus 

To achieve an economy of effort in^pursuit. of estimates 
of incidence and prevalence, the focus of the inquiry must 
be precisely formulated. Not only is there a diversity of 
the populations s.ubsumed in various data fi:les^ hut data 
collect ibn^ efforts have rested on varying definitions of 
what conditions should 'be included and how they are 
described* 

A machine-readable data file's content can be described 
in terms similar to those describing any other information 
medium. In the case of numeric data files,, however, the 
description of content is usually more specific than is the 
•case for print materials. A machine-readable data file 
either does or does npt contain income figures f orresidents 
of Nevada. It either does or does not contain pe.ople's age. 
The file may contain information which could be used to 
estimate such figures,, or may contain keys which would allow 
.it to be merged with ' a file containing such figures. * A 
machine-readable data file, however,* cannot allude tOt, 
imply, or mjBtaphorically describe things. Thus, a machine- 
readable data file is usually either highly relevant _ or 
completely irrelevant, to answering a particular question. 
Such specificity in a numeric file als.o implies that a 
description of the file's content is unequivocal. The 
variables, their -unit of measure, and the population which 
they describe are as they are,, regardless of the purposes 
for which the i?nf ormation is used.'^ ' 
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In attempting to answer the question about the 
incidence and prevalence of disability, data Tiles of 
interest are those of national scope and have information on 
disabling conditions as well as geographic location, age, 
income, and race of respondent. Given the definitional 
problems, it is likely , that-, multiple definitions of 
disability will . be required, and that the definitions of 
interest to the secondary users *are not necessarily those .of 
interest to the data's producer. 

The initial question to be asked about each source of 
data is its relevance to the question. we want to ask and the 
population about which we want to ask the question. We must 
know five things about each source of data: 



l; What is its domain of discourse? e.g. 
a .variable which indicates disability? 
information on .income? 



r does it contain 
Does it contain 



2. How is information * of interest expressed? In what units 
of measure? 

;3. What populations can be estimated from the available 
.information? e.g., can we use a file based on a 
.national sample to estimate disabling conditions in a 
certain region or disability in people of a certain age 
range?', , > • * ^ " 

4. What results ate obtaine'd from* the file? What does it 
" say about disabilities in the population-of* interest to; 

us? ' , 

5. What confidence*- can we express in these results? To 
what degree are they uncertain? To what degree are they 
biased? 




Statistical Goals / ^ 

In addition to errors of coverage, definition, and data 
'collection technique , we must attempt to cope with two 
distinctly statistical problems, uncertainty and bias . 
Given the present state ot data on the disabled, it does not 
seem wise to make our primary goal, that of maximizing the 
precision of estimates. At this stage, we should try to 
minimize the uncertainty of our estimates and to attempt to^ 
correct for bias. 



Uncertainty is ** the more tractable problem. Suppose 
that it is estimated that 30,000 people have a particular 
disabling condition. In fact, this number is not meant to 
be exact, but is the best estimate of the actual number. If 
pressed m^ore;^felosely , we would say that althdagK 30,000 is 
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our best guess, ;we are 90 percent ,sure f-hat the actual 
number of people vi'th^the condition is between 29,500 and 
•30, 500* If the data were unbiased; but had a lower degree o"f 
certainty, . we would ' stili specify^ that our best guesb was 
30,000* With the less certain data:, we might state that we 
were 90 percent sure that .there were between 29,000 and 
31,00.0 affected people* ^ Thus, minimiziag uncertainty 
decreases the size of the "confidence- interval" which 
surrounds a statistical statement* 

If the data were biased, rather than uncertain, then 
our best guess would djeviate from SOir.OOO, even though the 
confidence interval around that estimate might be unchanged* 
Bias might occur because some data are omitted in an. unknown 
but systematic fashion* Errors might be introduced as a 
result of iaterview* bias" or s.elf -report ing bias* 'While, 
uncertainty^produces fuzzy results, bias produces incorrect 
results* Our task is to state the degree of uncertainty 
surrounding our results and to indicate, it possible, the 
magnitude and dlrection^ of any bias* 

Finally, we must give some idea of the sensitivity of 
our., conclusions to our assumptions* Suppose that there were 
several common definitions ' of our hypothetical disabling 
condition* It is statistically prudent to do estimates of 
prevalence using more than one definition of the condition* 
If the estimates varied by only a few percentage points^ 
then we' could- describe our .estimate of prevalence as 
"robust." If, however, prevalence under one definition 
varied "greatly" from prevalence under the other definition, 
then the statistical presentation would require more 
qualification* ' - . 

;The most salient finding from- our review of current 
estimates of the disabled population of the United States 
was the wide variation ' in such estimates* It is not 
surprising that the estimates vary widely, since they are 
not based on comparable assumptions* Our conclusion is that 
a satisfactory estimate, of the number of disabled persons in 
the United States can be performed only if" the following 
conditions are satisfied: 

1* The data are from a survey which yields unbiased 
estimates of the U*S* population; * 

2» The survey records information on impairments,, 
functional limitations, and vocational an'd activity 
limitations; 

3* That estimates of the number of disabled persons be with 
respect to specific definitions of disability; 
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4v That analyses be based on microdata^* rather than on 
tables or aggregates. " 



Techniques For Estimation 

A number of techniques may prove useful for extending 
the utility of available .data/ The efficacy of a particular 
technique must be assessed in terms of the particuJ.ar 
question at issue^ the data available^ the skill of the 
analyst in making the estimate, and the skill of the client 
in interpreting- and using it. The following discussion of 
techniques is thi'S neither authoritative nor exhaustive. 



Imputation On A National Sample 

^ It is sometimes possible to infer information where it 
is missing from a record, or even from a file. For example, 
suppose we were estimating the number of people suffering 
from each of two disabling conditions. Suppose further that 
the best available data were in two files,- one for each 
condition and that each survey asked . about only one 
disabling conditipn. One problem is that persons suffering 
from both conditions would be counted, in each file, but that 
tlje double counting would not be directly measurable. One 
way of correcting such double counting would be to locate, 
studies of the co-occurrence of the two conditions by age, 
sex, occupation,; etc. The res^ults of such studies could be 
used to correct the overall estimates by adjusting the 
counts for pei^sons by age, sex, occupation, etc. Similar 
"imputation" techniques used within a file can sometimes 
overcome problems of missing data. 



Adjustment Of . Sample Weights 

It may sometimes be possible to yse files basea on ^one 
population to derive estimates for a'Yiother population, when 
\the differences in t^ie two populations' composition are 
knbwp. By^ the use of such weighting, \ fil^ of microda'ta 
with one ^age distribution might be , used to ^derive estimates 
for a .population' with a different age distribution.^ Records 
of people with under-represented ages would Be multiplifed by 
a .weight factor to increase their representatNi^on. ' Records 

— -■ \ 

^* Microdata are records' based on individual ^units of 
observation. A microdata record might be ba^ed on a p^ersori, 
case, service contact-, etc." Conversely, an aggregate ^data 
record- always contains' a count of the nujn^er of units 
described by the record. Microdata records have an implic^it 
.count of one.. ' \ 
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of people with an over-represented age would be weighted so 

as to decrease their contribution to the estimate. The^ 

efficacy of such weighting depends on our knowledge of the 

differences in the. compositions of the populations and our 
confidence that other sampling, conditions are "equal."" 



Aggregation And Merging 

.Very often files which cannot* be merged on a record-by- 
record basis at one level of aggregation can be merged at 
some higher jlevel of aggregation. For example, it may not 
be possible to match records between two files, each of 
which contain 10,000 records of individuals. However, if 
each 'person ' s record contained a county code, it would be 
possible to produce an aggregate file containing up to 3,042 
records. Such records would show the number of persons in 
each county having given characteristics. -In this case, 
3^042, the number of U. S. counties is a -"greatest common 
divisor" between the two 10 , OOO-record files. 

-"The trouble with such aggregation and merging is that 
the "greatest common divisor" can become arbitrarily small. 
li individual records it\ one of the two hypothetical files 
are identified by state rather than by county, then ,the 
greatest common divisor becomes the number of states rather 
.than the number of counties. However, even- very .small 
^merged files may be of interest in. answering particular 
questions. 

It is unlikely that the use of several files to make 
national estimates would involve any record matching. Such 
matching is appropriate only when two files contain data for 
the same set of persons. the most li kely processing 
technique in such a case is to construct a file containing 
records derived from each of . the^ original files. The 
concatenated file is then used as a p'Qoled sample. Such a 
procedure *. requires comparability by variable and 
comparabili t*y by value, but not comparability by 
observation. (See Appendix A.) Thus, for the construction 
of national estimates, the issue of record linkages between 
•files is moot. 



; £-tatist ical Matching 

Most record-by-record linkages between machine-readable 
data files are done on the basis of an exact match. Records 
are conside^red linked onlj^ if the values on key variables on 
one set o^f\records matches that of key variables on the 
second file. For-^ejcamp'l'e-,^ the RSA-SSA data link is matched 
on peoples' Social 'Security ^numbers. 

Statistical matchirigi is sometimes used where it is not 
possible to obtain a match key such as^ the Social Security 
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number. In statistical matching, the record keys are 
attributes of the statistical population- which will make 
records from one file "equivalent to those in the other 
file.'' Thus, microdata from two files might 'be matched on 
the basis of age, sex, 'race, and income in the hope that 
other information in the two fileg would be matched 
appropriately. The resulting file might be used to match, 
for example, a geographic location variable from one file 
^¥itl^^ an occupation code from the other file. In general, 
statistical matching is^ difficult and is not nearly as 
powerful as exact-match procedures.^' 



Linkage Within Tables 
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At the highest level of aggregaidon , it may be 
necessary to link files by entering their distributions and 
summary figures into multi-dimensipnal tables. In its most 
extreme form, where there are no redundancies in the data, 
each of the cells in a table may be a summary figiire from a 
separate file. Where^^several files contain data about the 
same phenomenon, it is possible to construct estimates 
through a number of table-fitting procedures. In this case^ 
bivariate frequency tables are constructed from data in each 
of the two files and a set of fitting procedures used to 
make for the greatest degree of fit between the two sets 6fr^ 
data. The project .on- Interagency Da.ta Linkages has produced 
several working papers on the construction and fitting' of- 
such tables. This jiroject represents the highest possible 
degree of organization • and elegance , in the matching of 
records , from diverse agencies. It should serve as a 
standard and a model of efforts to merge data files. ! 

i — 

The Interagency Data Linkage project in 1963 was 
established by the Bureau of the Census, the Social Security 
Administration, and the internal revenue . service. . VThfe"^ 
purpose of the project was fq develop files of micrqdata, 
containing information from files held by "-all three 
agencies, , but -devoid of personal identifiers. Such -tiles,, 
containing information on demographics, income, and earnings 
history, would be of immense value^ to researchers and 
.policy-makers. The project begins by. sampling the Census 
/Bureau's Current Population Survey and linking CPS records 
with tax return records and. Social Security earning^s records 
for the sampled persons . -In order to do the Ifnatching, 
selected personnel from et?h agency were sworn in as -special 
agents of the other agency. The proj'^ict produced a series 
of technical and substantive repo*:ts on the- matching 

RepoVt on. Exact and Statistical Matching Techniques, 
Statistical Policy Working Paper 5, U.S. Department of- 
Commerce, Office of Federal Stati§4:ical Policy ^nd 
Standards, 1980. ' ' 
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process, techniques for performing and evaluating matching, 

and on the substantive results, obtained from suclT 
matching. ^ • 



Characteristics Of Data For National. FIstimates 

If any reasonable national estimate is to be derived, 
then we must begin with a sample estimating a national 
population.. Many of, the high quality and complete files, 
such as the RSA-SSA* data link*, are of no use in preparing 
national estimates because they do not . have any intrinsic 

. relation to the national population. At best, the RSA-SSA 
data link is a self-representing sample of people who are 

'clients of both agencies. While it- is possible to 
approximate a national population, estimate by adjusting the 
weights of a non-national s&mpie, such efforts suffer from 
three defects. ,The first -is that such adjustments will 
probably add unacceptable bias to an alrear^y difficult 
ejnterprise. A second^problem is one of sensitivity with 
nespect to adjustments in sampling weight, which can only 
nave a malignant affect, on estimates of underrepresented 
segments of the national population. Finally, national 
Estimates derived from non-national samples are hot likely 
to be credible to a sophisticated audience, especially one 
with a political agenda of its own. A lack of robustness 
due to bia?-^-^. sampling is easy to discover, explain to 
others, and refute with statistics , drawn from a more 
congenially weighted sample. 

/ Measurement, Pro'blJms* 

' , J 

Nagp^* haS' discussed, sources of error > in the 
measurement of ^ .disability . All of the usual problems of 
reliability, validity, comparability, and bias ar,e present 
in survey data.. Obviously, the designers of surveys do 
their best to minimize their measurement error. However, 



Fritz Scheuren, H. Lobk Oh and Wendy Alvee, Matching 
Administrative and . Survey Information: ' Procedures and 
Result's of the 1963 Pilot Li1Tk Study, Report No. 3, Studies 
From Interagency ^Data Linka'ges, ;June 1978. >* 

Joseph Steinberg., Some Observations on Linkage of 
Survey . and Administrative Record . Data y Studies from 
Interagency Data ^ Linka'^ges, U.S^ jDepartment of Health, 
Education', and Welfare, Social- Security Administration, 
Office of Re'iearch and. Statistics., August 1973. 

" . Saad Z. Nagi Disabiliry in the United Stated; - A Plaii for 
an . Information System prepared for t.he^Offic.e of Handicapped 
I-ndi vidua Is, DHEW, October. 1977. 
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the definitional complex'ijty of disability requires that the 
arialyst be in a position to Second guess the ijitentions of 
the survey, adjusting category bounds arid relationships as* 
needed. The only level on whi.ch such adjustments cnn be 
made is .that of the, individual data record* Thus any* 
adjustment for measurement biases requires that the original 
microdata be used.^*^ 

Definition And Sensitivity 

Disability' estimates must be made for particular, 
definitions of disabili-ty. Such definitions are constructed 
by manipulating and con\bining-. variables in microdata 
records. i*he robustness , of a' national estimate can he 
tested by noting the affect of "slight" .perturbations .in the 
definition/ What is the effect on^the prevalence estimate 
when a sliqhtly more or Less restrictive, definition is used? 

In summary,^ the problems of making 'national estimates 
of disability seem more inherent in the specification' of 
definitions than in a lack of available data*. However, the 
process of projecting national estimates from available/data 
,sourpes' is not an easy one. It requires the assemblifVg. of 
data sources f rom * public and private organ iza't ions arid 
workijig out problems associated with confidentiality and 
proprietary. It will require on-hand experience and much 
experimentation. The cost-effectiveness of such an exercise 
will depend on such factors as the level of conf idence ' to be 
tolerated for the estimates and the generality (or 
specificity) with which disability is defined. 

The next section presents 'our recommendations tO; NIHR, 
including the pretesting and eventual* establishment of an 
archiving and analytic system ihat will have- the capability 
for maJcing such national projections. - ' . . 



Aggregate data can be used in the special circumstance 
- whe're the aggregation results in no loss of information. 
For example, a microdata file containing only age and sex 
variables can be aggregated to *a; file of records\giving the 
count for each combination of age and sex. ThjB aggregate 
file contains . exactly these same information as the micrc^ 
data file. ' " \^ 
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IV, .TOWARD A COMPREHENSIVE DATA SYSTEM 

The objective of this planning- project was to assess 
the feasibility of the establishment . of a comprehensive data 
system on the handicapped people ' of the nation. Our 
investigation showed that a comprehensive data system, in 
the sense of one integrated fil?, was not feasible with 
currently available data sources. Instead, we recommend the 
testing and eventual establishment of a more flexible system 
with analytic arid archival, capabilities- that could imfirove 
and build' Upon ^existing^^Qta' sources^ " ' 

This. sect iop inciudes our recommendations as well as a 
drscussion^ of 'the - statistic.al policies that NIHR should 
de'vel'op. We describe .steps MhR should take to achieve its 
^cpngressional 'mahdate'* pt^ improving disability statistics 
through the power of- coordination given to the Director of 
NIHR in^ the Interagency Committee on Handicapped Research. 

' ■ ' ■ . ■• ■ - / 

Recoirimen'dat i oris / , 

* The mission ^ of NIHR is to .carry out the Congressional 
mandate to/'promote .and coordinate research with respect to 
handicapped individuals" (P.L. 95-602). In interpreting 
this . iegislatiye mandate, the National Council on the 
Handicapped (NIHR's jpoi icy -^making body), envisioned NIHR as 
spearheading joint sponsorship of re^search activities by 
"coordinating related ^federal programs of research** and by 
'•producing, in concei't with 'other federal agencies, 
demographic and pther statistical information^ relative to 
handicapped individuals and disseminatinq ' ^ such 
information.^^ In addition-,. .^NIHR is expected to focus on 
research areas unsupported or ."npt adequately covered in- the 
missions of other'federal agencies. " / 

' The function of coordination is an important one, and 
give's NIHR the authority to addrjsss definitional and 
assessment issues essential to the improvement and 
establishment of a data collection system on- th.e disabled of 
the nation. The major problem with existing data sources is 
that handicap is not. uniformly defir\ed,^ but/ is measured in 
terms of entitlement to programs or services; Disability or 
handicap is a phenomenon that affects a largjb proportion of 



National Council on the Handicapped, "Pplicies Governing 
the National Institute of Handicapped Research, Review 
Draft ^dated May 5, 19.81. . . j 

Ibid , Policy No. 9, p. 6.' 
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the population, some of whom may not be eligible for, kr\ow 
of,, or choose to participate in, such programs. The 
definition* and determination of disability should be freed 
trorn programmatic concerns of eligibility and handicap and 
tied, to the functional Assessment of ability to carry out- a 
major life activity. NIHR is the only federal agency with a 
broad .and non-programmatic responsibility in this area. 
NIHR should^ therefore promote the concept that disability is 
an\' indicator of the health status of the nation and as. such, 
it should be part of any national survey, whether of general 
(such as the Census) or specific (such as the National 
Health Interview Survey) scope.. ' / 

Recommendation . 1. UlHR -should develop ' a convincing , 
argument to 'Coagress and the. Administration * for 
implementation . of a uniform functional; def iriitipn ^ of 
disability and handicap, and the inclusiop of disability 
data -in census and other national, surveys.. 

In order to fulfill it-s ' mission as coordinator and 
initiator of research, that NIHR must develop the resources 
and staff capabilities to function as a core multipurpose 
analysis agency in this area. Examples of other core 
multipurpose .anal^^sis agencies in the federal government are 
the Economics Units in the Economics Statistics and 
Cooperative Service in" the Department of Agriculture; the 
Bureau of Economic Analyses in the Department -of Commerce; 
the Social Security Administration's Office of Research and 
Statistics; the Assistant Secretary for Planning and 
Evaluation, Department of Health and Human Services; and the. 
Federal Reserve, fioard. 

Each agency has a responsibility for data analysis and 
dissemination in a specific- area. NIHR should • assume a 
similar function as the authoritative voice in the area of 
.disability statistics,. 

Recommendation 2. NIHR should become a core 
multipurpose analysis agency concerned with disability 
statistics'. 

One of its most immediate tasks' as a core multipurpose 
analysis agency should be the establishment, or support, of 
the Disability Data Archiving ahd Analysis Center (DIDAAC)* 
We believe that NIHR's information needs — and indeed 
information heeds of most others concerned with- disability — 
would be better served through the establishment of this 
flexible system that will have broad capabilities to improve 
diverse, sources of data. The system should be a continuing 
facility, driven by user demands and responsive to the needs 
of many types of lisers. Such a system will enhance NIHR's 
role as the coordinator of research and will eventually lead 
to standardization of^ -data collection, analysis, and 
dissemination activities in the area. 
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Recommendation 3/ NIHR should establish or support the 
establishment of .a Disability Data Archiving and Analysis 
Center CDIDAAC). 

Before fully implementing DIDAAC, however, it would be 
advisable to .have a demonstration project to test the 
feasibility and cost-effectiveness of the^ facility* 
Experience gained from this project in the utilization of 
available data sources for making national projections would 
be most useful in determining the institutional and staffing 
.characteristics of DIDAAC. _ ' ^. . ^ • 

Recommendation 4. NIHR should' support^ a demonstration 
project to .test ^DIDAAC^ capability in making projections of 
national estimates firoin' existing, data' sources. 

.To. move NIHR" toward becoming a core multipurpose 
analysis ^agency it is essential to develop basic statistical 
policies and to improve its coordinator role vis-a-vis other 
statistical agencies, as discussed below-. 



Statistical Policies ' . .^r * 

In order to function as a core m.ult ipurpose analysis 
agency> NIHR must adopt some , statistical policies. These 
statistical . policies must address both good statistical 
practice and the xoordinative mandate given to NIHR in 
P.L. 95-602. The most basic statistical policies concern 
the coordination of; data collection efforts and a data 
archiving and access policy. 



Data Coordination Policy 

, The Office-of Federal Statistical Policy and Standards* 
Framework^' gives a detailed set of • goals tor the 
coordination of statistical programs. The increasing 
interaction, and in some cases conflict, between agency 
programs and goals makes it* advisable. to coordinate the 
collection and compilation of statistical data. Tighter 
budgets and- increasing respondent burdens make it necessary 
to provide for a maximum of power and comprehensiveness at 
the* same time as expense, duplication, and respondent burden 
are minimized. An NIHR data policy must begin with a review 
of * agency needs and a review of data presently being 
collected. The current project gives a comprehensive 
catalogue of* available data relevant to a single set of 



Office of Federal Statistical Policy and Standards. A 
Framework for Planning U.S. Federal ^ Statistics for the 
1980^s . , Washington, D.C.: U.S. Government Printing Office, 
1978. ' « • 
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demographi^p^uestions. We also outline the areas in which 
statistical data seem egregiously deficient. 

An NIHR data collection and statistical policy thus 
serves to define .a list of NIHR data needs and to reconcile 
that list of needs with a set of efforts to collect and 
compile needed data. 



A Data Archiving And Access Policy ^ 

^.While ^data collection and statistical policy is 
concer"ned with ends, a da^a archiving* and access policy is 
concerned with? fhe means -' to reaching, those, ends*. * The' 
primary purpose of a. data archiving -and' access policy is to 
trainsfprm machine-readable data files into bibliographic 
entities. At present, a large section of the economy is 
dedicated to the production, maintenance, and dissemination 
of printed materials. 'Most machine-readable data files are 
treated- as auxiliary, private notes to printed reports. 
Throughout the government and the academic community, t^ere 
has been a movement to treat machine-readable data files as 
reco^gnized bearers of information, much like publications* 
Data files are being provided wi.th a bibliographic identity: 
a titl^f author, producer, edition statement, etc. Once 
provided with a bibliographic ^ identity, f iles^are being 
.accessioned by -libraries * using specially \designed 
cataloguing rules much like those applica'^ble to 
publications. ^ . 

A' data archiving and accessing -policy \^states that 
machine-readable data files produced under the^ auspices of 
NIHR, its grantees^ or its contractors will be \assigned a 
bibliographic identity, will be preserved, aqd will be- 
distributed to NIHR or its designated archive*. \ Once so 
distributed, files will be available for analyses ^by others 
subject only to restrictions of confidentiality.. The 
promulgation of a data archiving and accessing policy will 
bring NIHR into conformity with a current major trend^rin the 
community of data producers and consumers. (See Appendix 
C.) . • 



Technical Standards For Machine- readable Data Files 

Underlying a data archiving and access policy should be 
a set of technical standards for machine-readable data. 
Such standards provide guidance to data producers concerning 
the exact requirements of the agency. While the archiving 
and access policy ^treats files as publications to be 
archived and disseminated, the technical standard treats 
files as collections of records with numeric * codes of 
various sorts^, and states such things 9s the kinds of coding 
schemes to be used, 'the way in which records are to be 
formatted, and other information which, if implemented 
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properly, keeps analysts and computerniks on reasonably 
friendly terms* 

jPechnical standards should be consonant with the . 
ral Informa^tion Processing Standards (EIPS) but should 
also address needs specific to NIHR* For example , the FIPS 
give, a set . of standard codes to' be used fo? states, 
counties, and otKer locations within the United States* 
NIHR technical standards should recommend the use of FIPS 
codes for such location variables. However, FIPS .has no 
standard- coding .systems. » for^ variables relevant* to 
^ disSifaility. .-Therefore, it^^might be ^appropriate for NIHKi 
.•^ technical standards t6 cit^ . lCD'-9, DSM-.IIIf %r " th^ World 
Health' Organization classification- for disability* 4 Thus, 
the technical standards for machiine-readable data files are* 
the lowest level of implementation of a statistical and data 
policy. (See Appendix A. ) >. , 

Coordination With Other Federal Statistical Agencies 

If NIHR is to become a general-purpose, analytical 
• agency, then it must be coordinated with other parts of the 
federal statistical system. How ,such coordination is to be 
established is an administrative matter for NIHR. However, 
some general areas of activity can be described. 



Standards And Practices . ' ' 

One of the best ways to develop a good statistical and 
data collection policy is to adopt -the best policies and 
practices of other agencies. The drastically low personnel 
level of the Office ot Federal Statistical ^Policy and 
Standards has led to the establishment of large numbers of 
interagency coordinating committees. Such an interagency 
committee could be formed to help NIHR with the formulation 
of a -statistical and data collection policy. Such a 
committee, composed of e'xperienced statistical specialists, 
could also advise on standards and practices to be 
established within NIHR subsidiary to the statistical, 
policy. The establishment of such practices would have the 
effect of bringing NIHR rnto compliance with the best 
practice, and of providing for commonality of standards and 
practices with other advanced statistical agencies. The 
* current Subcommittee on Demographics and Statistics of the 
Interagency Committee on Handicapped Research is the logical 
•basis for beginning such a liaison. 
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NIHR Primary Data 

The.\;^;NIHR statistical policy should specify a set of 
statistical series which can be obtained from other agencies 
or which can be generated in collaboration with • other 
agencie's. NIHR will then have to assume^ responsibility for 
.the .establishment .and collection of statistical series 
primary 'to its needs v^hich' are no£ avai.lable from others, 

Colleqial Relations With Statistical Agencies ' 

After the establishment =pf a statistical policy and the 
planning of its: own collection efforts, NIHR will have 
changed its relationship to other statistical agencies 'from* 
client to colleague. It is likely that advisory committees 
will :now be replaced by coordinating bodies of various 
sorts/ and ^that NIHR will participate' ifi helping other 
agencies to plan data collections of mutual interest. 



Joint Work On Data Collection 

NIHR, a^ a collegial agencyv will now be able to 
collaborate with, other agenc ies on definition and collection 
of statistical information. In its new position, NIHR may 
find itx easier to h61p service agencies to make their 
administrative data more useful to policy-makers and 
researchers. 
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V. INTRODUCTION TO DIDAAC 

rf NII^R is to become' a. multi-^purpose analysis agency; 
then it must have a data archiving and analysis capability*^ 
Such capability would reside in a disability ' data archive 
and analysis center (DIDAAC). In particular , the 
establishment of sucl? a, facility, is the best way to provide 
,b,ontinuing' ^nswer^ to questions regarding the demographics, 
»S€;rv»ices cpnsum^^' by^/ and n^eds bf the, disabled.. ^ . 

This section describes, in " detail functions an<^ 
capabilities of the proposed center and presents alternative 
Ways of- implementing it r including cost estimates for such a. 
center. In view of the financial constraints on NIHR, we do 
not propose that DIDAAC >e implemented immediately-.^" We 
recommend that the feasibility and cost-effectiveness of 
DIDAAC be tested in a demonstration project, designed to use 
existing data sources to provide national estimates' of 
incidence and prevalence rates of various. disabling 
coridit'ions. 



Characteristics And Functions Of DIDAAC 



DIDAAC could perform the following functions for NIHR: 

o • 

1. -Perform analyses on primary and secondary data files for 
NIHR acquiring or using its own archive files as 
necessary; 

/» 

2. Perform data analyses and respond to queries for other 
.public and private clients; * ' 

3. Advise NIHR on technical matters relevant to data 
collection, including contracting requirements; 

4. Promote the collection of disability data that provide 
information on. etiology well as functional 
limitations and activity restrictions; 

5. Provide NIHR with technical advice concerning the 
monitoring of. adherence to data and statistical 
standards, and' even tT3alTy"leaB^i^ less duplication of 
data collection efforts. 
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Internal Functions Of PI DA AC 

' , DIDAAC's activities can, be classified under nine 
headings: acquisition, cataloging., abstracting, reference, 
circulation, file processing,, analysis, and dissemination, 
and recommendations 'for Bata improvement. The first five of 
these f unctions ^.ar^ similar to those in a library of print 
materials ; V the ne.xt three ai^^ specific to^a^data library .and 
dipseiirtination system, and the. last ' tocu>vf?^^ needs' 
for improvement in t][ie ,s.tatistae'ai ^sf'^em. 5 * » 

Acquisition . The acquisition function i,s concerned 
with locating 'machine-ireadable data files ar/d their 
documentation. Some fileS, such as those maintained by RSA,^ 
can' be acquired on a* recurring and automatic basis. DIDAAC 
should be kept aware of NIHR grants and 'ontracts which will 
result in the production of machine-readable data files; 
Contractots and grantees producing such files should^bfe 
required to^ furnish copies to NIHR. DIDAAC can serve as an 
archive for such files and can supply contractors and 
grantees with technical assistcince , in meeting technical 
standards -for machine-rea'dable files submitted to NIHR. 
(See Appendix B. ) 

Those ^charged with the data acquisition function must 
maintain continuing contact with other government data 
producers aha with data archives. There should be a 
continuous exchange of information with other government 
data sources such as the National. Center for Health 
Statistics, the National Center .for Educational Statistics, 
etc. * " ' • • 

In addition to formal contacts with NIHR grantees and 
contractors and with other government agencies, ^DIDAAC 
should develop a network of informal contacts with data 
producers and users in government, private industry, and 
academia. It is a. safe gues^ that there will be occasions 
when such infor^nal expertise will provide materials which 
would be unavailable through "normal" channels. 

A part of the initial list for th'e^acquisi t ion program 
is to be found in our interviews with potential users, who 
suggested many useful machine-readable data files. Another 
^ starting point is the catalog of machine-readable data, files 
""compiled over the last six months. Beyond the catalog, 
which has concentrated on files which appear to be directly 
relevant to problems of the handicapped, DIDAAC should 
subscribe to the catalogs* of, all potentially relevant data 
collection and archiving organizations. Such catalogs 
include those periodica;lly .produced by the.. National 
Technical Information Service and the core multipurpose 
collection and analytical, agencies. In addition, DIDAAC 
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should have access to general population statistics. Such 
statistics can be used in estimation equations; they also 
provide the necessary background information essential to 
understanding the sigi^if icance of the disability phenomena. 

Cataloging . DIDAAC must maintain a catalog of machine- 
readable data files, ^documentation, and relevant 
publications. The catalog should be maintained as a "union 
list" of materials in the collection, materials obtainable 
from other sources, an3 materials which could be obtained if 
necessary. The current edition ^ of the Anglo-American 
Cataloging Rules (XaCR-II )^ ^ givefs extensive information on 
techaiques .f or ; cataloging rnachine-readabie data files i 
'while"' not in itself a library catalog, the inventory of 
machine-readable data files' compiled under this grant 
provides a set of information which can be used as the. basis 
for a catalog. ^ ^ 

Abstracting . A retrieval system for^ abstracts of data 
files and publications could be combined with the library 
catalog.. In most cases, the abstracting task will require 
far more substantive knowledge of the data and the area of 
investigation then will be available to the usual library 
cataloger. If DIDAAC is to support an abstracting function, 
it will require that disability specialists be included on 
the library staff. 

Circulation . The circulation function is concerned 
with inventory control and disclosure of information. The 
DIDAAC collection consists of an inventory of tapes and disk 
files. The circulation function is concerned with* knowing 
the location and current status of ^ such files, and' with 
supplying file information to library staff and users. 
DIDAAC would Tiot "lend" files in- the usual sense. Tapes and 
disk files would be made available to users of DIDAAC' s 
computing • facility-. Files for use at other computing 
installations would be copied to export tapes for shipment. 
Thus, at no time would a file ever be "on loan." 

An important task for those concerned with circulation 
js. to maintain conformity with various privacy and 
confidentiality standards^ If DIDAAC is to be a ''protected 
enclave," able to exchange confidential data with other 
government agfencies, then someone must protect confidential 
data from unauthorized public disclosure. 

Reference . The reference function mediates between the 
client and the DIDAAC collection. The reference function 
may bfr performed by a librarian, a substantive specialist, 

r 



American Library Association.^ Anglo-American Cataloging 
Rules. Chicago, IL: R. R." Donnelly & Sons Co., AACR-II, 3nd 
Ed, , 1978. * ' • 
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or a computing .or. statistical specialist. At the outset the 
reference function would probably be performed by those 
"involved in data acquisition. 

The reference function encompasses most of what are 
also called ''user services" and "marketing," " New clients 
will deal, with Reference, who will introduce them to the 
other functional group§^ within DIDAAC as n.ecessary. Ih many 
; — ^cases.,^br4ef— requests^ w.i.ll~pr.obably-JDe^andled~4>y — Reference 
without the need f or\;ief erral to other DIDAAC groups. 

^; ' File processing ... File processing is almost exclusively 

the .province of completing and .statistical specialists. File 
processing, is for* two main purposes: to bring .the file to a 
* desired state of readiness; iand to prepare it for -a specif ic 
analysis. Initial processing includes the preparation of^ a 
machine-readable dictionary and user's guide, and checking 
the file's completeness,- consistency > and accuracy* (See 
Appendix D.) In some , cases, a set of classification 
variables may be created before the file is made available 
for use. Initial preparation may also include the deletion 
or encryption of personal identi fiers. 

Many users will require the creation of special 
' variables or indices, the aggregation of microdata files, or 
the merginci or linking of sets of files. In some cases, the 
files , thus . roduc'^d may be :^uitable for inclusion in the 
permanent collection. . 

Analysis , DIDAAC ' staff. *may include statistical 
specialists and people with substantive expertise in areas 
of disability. It would then have the capability of 
providing statistical consultation to clients and of writing 
research reports in response— to queries. 

Dissemination . Regular reports could he produced to 
present results of DIDAAC efforts and findings and to 
suggest future directions of activity for data collection 
agencies. DIDAAC s purpose , and availability* would be 
widely announced to facilitate use of the system. These 
functions would be geared toward enhancing NIHR's role as a 
coordinator in the area of disability leading to a more 
^ integrated data collection effort among all agencies. 

Data improvement . It is reasonable to assume that 
DIDAAC- will "he Tnitrumental in improving the quality< of 
. disability statistics . Its file processing activities 
should eventually lead to accumulation of highly comparable 
data files on disabilities. Through its statistical 
consulting and technical dissemination activities , 'DIDAAC 
should'encourage changes in various collection efforts which 
would improve the comparability of data. DIDAAC could 
demonstrate and . promote the necessity of collecting 
information on the etiology of a conditioi: tiS well as^ its 
consequences in terms of functional limitations and activity 
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restrictions*. Such information would, greatly improve our 
ability to more accurately relate disability to its medical, 
and behavioral antecedents. 



User Requests ' Arid DIDAAC Services 

The following ,are some of the ^types of requests which 
.might-be-made —of — DIDAAC^.and examples^ of the types of 
s,ervices needed to fulfill such requests. {NeiJ:her the list 
of requests nor of services is' meant to be exhaustive. ) 

\* * "Hot ' liri'e" requests • for : a* fact. , Many ^requests to 
\inf ormat ion providers are f.or a fact, i .e. , . something^ which 
c^n be expressed- in a single sentence. The classic 'example, 
of^.such a request is '*How many disabled people are there in 
the\ U. S.?". In some instances^ the person responding to 
£he \equest may simply know the answer, or may be able to 
look i.t up on the spot. * A rough definition for a "hot line" 
request might be that it can be answered, in less than one 
'hour andl^ that the answer can be transmitted in a normal 
telephone, conversation. 

Requests for a bibliographic reference. DIDAAC will- 
not ordinarily serve as a bibliographic reference service, 
since such functions are already well, .performed by a variety- 
of services. \ A client with;only a bibliographic , request 
--WouTd probably\be referred to the appropriate agency. A 
.^bibliographic request arising; in the course of ' a client's 
continuing relaSivpn with DIDAAC would probably be handled by 
'DIDAAC staf fmembervs as a courtesy to the clients 



DIDAAC itself wpuld probably be both a client and a 
provider of materia^ls to bibliographic reference services.. 
Much data documentation and most machine^-readable data files 
have not been properly cataloged and abstracted. DIDAAC 
would make ■ its own catalog records and abstracts available 
to bibliographic organizations.. 

Requests for publications . DIDAAC will serve as a 
primary source for its owh analytical publications and for 
documentation for machine-Veadable data files in its 
collection. Recurring requests for the same information can 
be satisfied by ^ the publication of .a series of reports on 
the demographics and epidemiolc^gy of disability. 



Requests for anklysis > 
pass requests for analysis 



The 



\ 



reference facility will 
on\ to the analysis facility, 
which will be responsible for devel^oping cost estimates^ and 
performing the workl Analysis can be orders of .magnitude 
.more costly than the provision. of facts, publications, or 
bibliographic references. The latter products and services 
may be provided to a clientele without charge, but the 
former may entail sufficient expense as to make necessary a 
fiscal justification. In some cases, tlie client may pay .the 
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expense,s of the analysis; in othe!: cases, internal funds may 
be used. In either " case, there must be some budgeting 
mechanism, so that an. analysis task, does not 'consume 
unwarranted resources. 

. Requests for a machine- readable data file . DIDAAC 
should provide files and documentation to clients who prefer 
to use their own computing facilities. In some- cases, 
'DIDAAC may acquire and prepare files for a client, who then 
has the files ^ analyzed at another computing facility. In 
such a case, a copy, of .the file and documentation would ^ be 
added to the DIDAAC collection. 

Requests for ' special reports . DIDAAC personnel with 
substantive expertise in areas of disability can write 
./Reports for clients, but such aictivities must be. budgeted 
and approved on 'an individual basis. 



Functional Components Of DIDAAC 

A full specification for a disability data archive and 
analysis . center (DIDAAC) center requires a specification of 
hardware, \ software, data, procedures, and people. . The 
specifications given, her.e are only estimates, but can serve 
as a guideline for evaluating proposals. ' | 

Hardware . Primary hardware of DIDAAC is a medium or 
large scale, computer housed in a compu^;ing center with 
extensive data storage facilities.. An eqiially important 
piece of "hardware" is a li bra ry-and' cataloging faciiity^foc 
hard, copy of doc.umeniatiolfi and publication , \^nd microform. 
It is not necessary for the facility to nave its own 
computer. The amount' of computing that an\y conceivable 
facility would do is easily within the capability of .any 
large research-oriented, computing center. The economies of 
scale incurred by using a large computer especia]\ly one in a 
'center with facilities for tape librarianship, far outweigh 
any possible' advantages of possession of the machine. 

Reasonable candidates for, a DIDAAC computer center 
include the . National Institutes of Health computing 
facility, the- University of Michigan or Wayne St^te 
University Computing (^jsnters, or any of the local Washing;t6n 
Computing Centers such as American Management Systems, 

COMSHARE,* etc. ' ' ^ 

M [ ' ' ' 

^ Sof tware . .A major„ factor in , the choiice of the 

computing center is the range . o£ software available. 
Presently available software- is fully adequate to do the 
computing, bibliographic, and publication needs ,6f • DIDAAC. 
There is no need for the* writing of any appreciable amount 
of software, nor should DIDAAC be required to import and 
maintain its own programs.. However, DIDAAC can^ be expected 
to make heavy Use of software for file manipulation, 
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statistical analysis,, graphics, information retrieval, ^ 
document processing, and communications • 

File Rrqcessing in statistical spftware are often 
combined in ^ these same package. File processing' programs 
are' used for aggregation, merging, *and the creation of new 
variables and - indices. Statistical programs are used for 
the analysis of files createcl by the file processing 
programs. The. most popular statistical and file processing 
package is SPSS/ a well documen^ted package but one with 
serious shortcomings as a file management package. More 
powerful packages, ' for , file ^ management include the 
Statistical Analysis System ,(SAS)\ and OSIRIS IV. SAS ,has 'a 
powerful, set' ;6f/ analytic "routines, and is' growing in 
.popularity. OSIRIS, IV ^has as its major , advantage the 
ability to handle multilevel, structured, • and hierarchical 
files;' . \ . * . - 

Some DIDAAC * users may prefer to do their .own 
statistical work using an interacti ve-- statistical system. ^ 
statistical systems with interactive capabilities include P- 
Stat, .the SPSS Conversational Statistical System XSCSS), and ^ 
the Michigan Interactive Data Analysis System (MIDAS). ^ At 
least one interactive statistical system should be provided 
for the use ^of . DIDAAC clients. , 

An 'increasing number of statistical results are being 
disseminated as graphics rather thafi numeric ^tables. Some 
statistical packages such "as SAS and SPSS have integral 
graphics routines, .and offer a choice of graphic .and tabular 
output formats. ^S.uch facilities should be offered to DIDAAC 
clients eithet,, as hands-on option or as outputs generated by 
the technical 'staff. • 

Graphic. X output requires a suitable output device. 
DIDAAG' users should have access to a variety of high and low 
resolution plotters. Inexpensive low resolution devices can: 
be 'purchased by DIDAAC' 'for remote connection to the 
computer.* Expensive^ .high resolution devices should be 
provided by the computing facility on a chargeable basis. 

If DIDAAC' is to have a statistical publications 
mission, then it should employ the computer as a publishing 
mechanism., Computer based text processing facilities allow 
the incorporation of numeric and graphic output into 
publication without the need for transcription. The 
computing facility should have available multi-font devices 
like the Xerox 9700 Page Printer and a phototypesetter . 

' DIDAAC differs , from a contract research' computing 
service .in maintaining a continuing .collection of data and 
documents. Such a collection becomes useful only with "a 
proper catalog, a catalog Vhich is best maiatained as a data 
ba'se. DIDAAC should be able to make its catalog directly 
available on-line to its. users as well as to its staff. The 
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information retriev^al software\ for - doing so should be 
provided by the computing center / while the file of datalog 
information will be provided by DI^DAAC. 

An electronic message system and an •on-line 
conferencing system nave prov.ed valuable for facilities, such 
as DIDAAC and should be made availatile to* the facility and 
"its clients. Ah * electronic message system - serves as a 
valuable way^ fpT^clients .to -traJ^snf^^J vork orders.,, and to 
seek and receive aid. On-line conference programs -provide a 
medium for on-goijng technical and administrative, seminars. 

Data . The- /core "Jof DIDAAC is- its data archive. A 
continuing program of data .acquisitioH,, should result .in a. 
definitive archive oiE data, onf all aspects of. disabilrty. 
The growth of a linkage between NJ^HR an'dl , other statistical- 
agencies should result in 'regular procedures tor the 
acquisition pf government data.^ la addition, DIDAAC * should 
serve as NIHR*s official repository for -data gathered under 
NIHR grants and contracts. 

. Data, analysis projects conducted within DIDAAC should 
differ from those conducted in the usual course of research 
contract by, being oriented toward "improving the 
collection." Since the DIDAAC archive will be a continuing 
facility^ it wilL be worth a .c:ontinuing investment in .the 
upgrading of tne> files in:'the collection, 'thus, .a client 
request for a specific file creation job may in fact result 
in the^ creation of a more general file. The investment in 
the more expensive process of creating the general f*le is 
justified .because the file contributes both*to the cl^ nt's 
needs and to the qualityo'f ^he collection. 

Whi/le machine-readab^e data files are the raison 
d' etre , ' few. if any files will be maintained in the same 
location as the DIDAAC staff. Data tapes will be maintained 
at the computing facility under the custodianship of the 
computing-facilities tape librarian. The responsibility ior 
back-ups, tape maintenance^, and^ vault * storage of 
irreplaceable data will remain with DIDAAC. Arrangements 
for the- care of files containing, personal identifiers or 
dther confidential information must be worked out between 
DIDAAC and the computing facility. It is not likely that' 
such files will amount to any significant proportion of the 
PJDAAC archive rior that ^any extraordinary procedures will. be' 
.required to handle such files. ; 
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^ ' Procedures 

. . The operating procedures of DIDAAC will^ resemble those 
of libraries, computihg centers, and research s.ervices. ^ 
Procedures will, of coursjB, change with experience.. It is" 
possible at the outset to specify some broad- clasises of 
procedures which must be\established. 

. L ■ ' ^ \ . \ 

' User services . DIDAAG is primarily driven by the needs 
of its users. Accordfngly, procedures must be estabii\shed 
for determining what services are available, to whom they 
are a\^ailable, and which DIDAAC staff will be made available 
to handle particular kinds: of .requests. It may be» necessary 

. to establish a priority system for classifying customers and 
one for classifying the sorts of work to be done. Some 
sorts of analyses or data may Jbe classified as beyond 

. DIDAAC':S scope of work. (The problems of high priority 
clients with low priority tasks or gt low priority clieiits 
with high priority tasks are too baroque 'even to 
contemplate.) At the outset, the technical staff should be 
responsible for customer relations, once a. user has been 
accepted. If there is significant growth in the use of 
DIDAAC, then it" may be necessary to have one or more persons 
who serve as a gatekeeper, allocating technical and staff 
resources to users. 



Costs . There has been some controversy over the degree 
to which DIDAAC should function on a cost recovery basis. 
At one ey-treme is the . model of the contract research 
organization which recovers all operating expenses from 
users. At \ the other^ extreme is the model of the public or 
university library (or the Congressional Research Service) 
which offers "unlimited" services to its user community. 
DIDAAC has. the potential f or^, incurring^^ an . extraordinarily^ 
wide range of costs. Some requests may be answered in the 
coOrse of a one minute telephone call, \ while others may 
require several person months o,f effort and large amounts of 
computing.^ \^ 

The most satisfactory way to allocate resources amoiig 
clients is by the use of an internal 'dollar-cost system. 
All expenditures, whether in personnel time or other 
resources, should be expressed in-dblTars and charged to a 
budget. The degree to which the internal cdfet structure is 
presented to users is a decision for DIDAAC manaigement and 
NIHR. In som^ cases.,, the user may_be ent/irely ignorant of 
the internal cost function. For example, what is advertised 
♦.externally as^a "free" public inquiry service could ,be but 
*a:counted ffcr internally as a set of $20 budgets available 
1 3' public ytisers . Any person with a casual inquiry coUld be 
serviced/ up to the $20 limit, could be asked to provide 
justification for a higher Expenditure, or could be askjsd to 
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pay costs beyond public, us^e budget.. Similar budgets could 
be set up . for other classes of users and may be made 
^mani-fest to them as required. 

Experience has shown that such an internal dollar cost 
system is the only rational way of allocating resources 

among a multitude of users. / 

I . ■ — 

Marketing . It is in. the interest of both DIDAAC and 
NIHR. that It serve as wide a constituency as possible. 
Broad use of the facility will tend to- encourage a flow - of 
statistical data and ideas' about disability, to improve the 
quality of data, and to improve the quality of DIDAAC' s 
collection../ The marketing function should concentrate on 
identifying potential users of statistical data and 
analyses.. Accordingly, ^cooperative arrangements should -be 
^-^ade with other data producers and with- coordinate agencies 

such^ -as- ^the^-c^Nat ional /Rehabilitation Information Center 

(NARIC) . Such cooperative marketing arrangements will allow 
.DIDAAC* to maximize its potential users base while 
, maintaining friendly and cooperative relations with other 
suppliers of statistical and bibliographic -data. 

One of DIDAAC s most important publications will be 
it's list of new acquisitions of data and documentation. 
Such a list not only informs potential users of new DIDAAC 
capabilities, but also serves- to inform them of research and 
statistical work -in the field of disability. 

Publication . \ One of the main purposes of DIDAAC will 
be as a center for reliable statistical information on the', 
'disabled. While most DIDAAC reports will be generated in 
response to specific [user querie^, it is expected that a 
core set of ,disa^bi|li ty statistics will be of general 
interest. Requests\ fjor suph "best selling" information can 
best be filled by a /regular program of publication. 
Although such publi^cat ions will be used to fill large 
numbers of. user rejques'ts, the analyses supporting the 
publication must be '(scheduled^ by the DIDAAC management. 
Thus, a schedule of\ recurring publications must.be matched 
by a schedule of reciiVring analyses of data to support the 
publication. " \ 

:' \ 

Data acquisition s DIDAAC differs from a contract 
research computing center in that it has a continuing 
Responsibility to maintain and improve its collection. 
Accordingly, significa'nt attention must be given co 
maintaining a liaison with other producers of statistical 
data. NIHR should require all contractors and grantee's to 
supply DIDAAC with| copies of machine-readable data files 
prpduced in. the courjse of research. 

Data quality co'ntrbl and maintenance . DIDAAC must have 
internal quality cdYitrol function so as to inform users of 
the quality of data and analyses. If DIDAAC is to serve as 
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a repository of data produced under NIHR grants and 
contracts, then it ^:an serve to monitor the quality of such 
data. NIHR ml^ght ^ included as a special condition ot 
research agreements that machine-readable data files must be 
submitted and must pass certain quality control checks. 
(Such data checking will .require more than trivial amounts 
of resour.ces, but will result in better research results.) 
NIHR efforts to improve the quality of data can only serve * 
to ameliorate the quality pf statistical data throughout 
government. 

. In addition to data quality control DIDAAC must have a 
data maintenance program. It has become apparent that 
machine-readable storage media such as magnetic tape are 
susceptible to deterioration over time, especially if" they 
remain unused. DIDAAC must have a policy of tape 
inspections and back-ups, including secure vault storage of 
irreplaceable statistical data. 

Accounting . DTDAAC must have facilities fpr the 
accounting of cost, effort, and results. The internal cost 
accounting system is" a necessity for balancing the competing 
needs of users. A system for accounting of productivity^ is 
necessary to determine the extent to which DIDAAC is a 
successful investments. 



Administrative Organization Of DIDAAC 

DIDAAC's administration divides into two major areas, 
the technical administration of data and ."the use of those 
data to produce knowledge about disabilities^. These two 
major functions could be further divide^ into data 
\archiving, data operation, statistical consulting, and 
client services. These four groups are discus^sed in more 
detail below. • \ 



Management . Initially the management functions of 
DIDAAC can be performed as an additional duty by one of^ the 
technical staff. As the organization grows, a specialized 
management staff can be developed. ^ \ 

Staff . The DIDAAC staff t^ust have a knowledge of the 
disability area and disability statistics, as well as. 
general knowledge of computers and programming. The number 
of people to be employed and the division of labor is a^ 
function of availability of funds and DIDAAC* s potential 
market. It could usefully employ people with background in 
computation, librarian ship, statistics, and disability. In 
some cases, DIDAAC could function- with the help of 
specialists called in on a consulting basis. The next 
section gives a more detailed description of an 
administrative organization for DIDAAC and potential levels 
of staffing. j 
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Data Archive ^ The actual maintenance of data tapes 
will be done by the computing center. DIDAAC's own data 
archiving operation will consist of accessioning tapes, 
constructing catalog records, and pursuing a schedule of 
back-ups and file saving operations . Data archiving 
functions are best handled" jointly by a computing specialist 
and an librarian. A person of either specialty can be 
designated as DiDAAC's official data archivist. 

Data operations . Data operations are the core of 
DIDAAC. The actual operation of the computer is left to the 
computing center. DIDAAC* s data work should be performed by 
technical people' with Bachelor's or Master's level training 
in computer science and in the social sciences. As 
technical' people gain proficiency, they wMl be able to 
perform both as computer people and as substantive 
specialists in disability statistics. Since^clients with 
complex jobs will usually have their own expertise in 
.disability, it can be expected that growth in DIDAAC staff 
will occur primarily among its data operations people. 

Research and client services. The research and client 
services group is the repository of DIDAAC' s substantive 
expertise in disability and disability statistics. The 
group serves as a liaison between clients and the computer 
people in data operations and as a client of data operations 
for the production of a DIDAAC research reports. Necessary 
skills for personnel in client services and research include 
the social ' sciences, public health, disability, 
rehabilitation, statistics, and librarianship. 



Implementation Of DIDAAC 

A data archive and analysis center could be established 
in several/ ways. These include: 

1. A facility established within NIHR, using government 
personnel and government or contract computing 
facilities;^ 

2. A. facility or activity within another federal agency 
supported by an interagency agreement with NIHR; 

3. A new center within the Federal government, but outside 
of NIHR; 

4. A center within a private facility supported by a 
federal grant or contract. 

DIDAAC s- responsibilities will involve it with both the 
supply ^of and the ^demand .^or statistical data. DIDAAC' s 
staff; members \ will be involved with other government 
statistical agencies and with researchers and contractors 
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^wlvile locating and accessioning data files. They will also 
be concerned' with the longer range matters of setting 
requirements for data and setting and enforcing data 
standards. On the d emand side, DIDAAC's staff members will 
be involved with NIHR and other executive branch cTients, as 
well as congressiohal, .state and local government, and 
. private-sector clients. 

One of DI'DAAC's most important characteristics is its 
continuity. The facility must be given time^ to establish a 
base of expertise and a data collection. Without a 
• continuing basis of support, it will offer little that an 
expert researcher or contractor could not provide for 
themselves • 



Criteri a 

The alternative implementations for DIDAAC can be 
evaluated using criteria of staff expertise, excellence of 
collection, continuity of existence, jj .independence, 
timeliness of resppnse, and cost. 

Staff expertise , DIDAAC must be able to field an 
'expert senior staff and to maintain a reputation for 
expertise. This criterion is more than the fatuous remark 
that people should be good at their jobs. DIDAAC^^*must have 
access to recognized experts in disability and'^ disability 
statistics, either as staff members or consultants. Lack of 
such of demonstrable expertise may not adversely affect the 
quality of DIDAAG's work, but it will decrease its 
credibility. 

Excellence of .the collection . DIDAAC must maintain a 
definitive and comprehensive collection of files on 
disabilities. The requirement for a collect ion of 
demonstrated excellence is exactly analogous to the 
requirement for staff expertise. 

Continuity of exi stence . It may be possible to begin 
operations with a staff of wide reputation and .undisputed 
excellence. However, the development of a full staff will 
probably take some time. The development of a definitive 
data collection will require considerable time under any 
circumstances. Once DIDAAC begins operations, it will 
require some time to develop a reputation for institutional 
excellence and to become generally* known. All of these 
processes will be facilitated by a continuity of support and 
of institutional identity. Changes of name, institutional 
affiliation, or an excessively high turnover rate of senior 
staff will tend to work against an impression of competence. 
Interruptions in funding will work against the reality as 
well as the impression of competence. 
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Independence , DIDAAC must be independent from the 
excessive influence of program specifi/c statistical 
agencies. It must also be perceived as independent from 
such influence. Such- independence can be /facilitated in 
- severa l: — ways. &stab l -i-shmeat ^af D.LDAAC.Xn an independent 



institution can insulate it from, undue in/flyence by its- 
clients. The development of a wide client /base can provide 
some stability when some costs are recovered . from n'on-NIHR 
clients. However, it is unlikely that/ the majority of 
DIDAAC's costs will be reimbursed by non-NIHR clients. 



Probably the best way to safeguard DIDAAC' s 
independence is through the reputations or its senior staff 
members. People with established reputations for 
professional integrity are unlikely to exhabit bias. 

Timeliness of respdhse . While DI^DAAC may perform a 
number of recurring or long-range studies/ most of its work 
will probably be directed to satisfying one-time requests of 
short to moderate duration. DIDAAC should have sufficient 
staff on hand to respond to high priority requests without 
undu^ turbulence in scheduling and stafyfing. It should also 
have adequate communication facilities/ including sufficient 
telephone lines, terminals, etc. / 

Cost. Although a truism, it should be said that DIDAAC 
should operate as efficiently and inexpensively as possible. j 

Costs For Several Levels Of Staffing 

It is beyond our expertise to prepare a budget for a 
government facility. Table 3 shows some possible levels of 
staffing, ranging from a minimal staff to a very large 
staff. Figure 1 shows estimated costs for each level of 
staffing. The salary, expense, • and indirect cost 
multipliers used are BSSR's, but are representative of what 
most contractors will charge. 
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Table 3: Examples of DIDAAC Staffing Levels 
(Full Time Equivalents) 



• 


Minimum 
\ (A) 


(B) 


(C) 


(D) 


(E) 


(F) 


Max 1 mum 
(G) 


% 

.facility manager 










i d 5 u 


1 • u 


1— n. 


Senior Researcher 




0 . 5 


U • D 


1 • U 


1 . u 


1 • D 


< ; U 


Junior Researcher 


\ -1- • u 




1 • u 


1 • u 


z • u 


z • u 


z • u 


- Senior programmer 




1.0_ 




1.0 


1.0 


1.5 


2.0 


Junior Programmer 


0.5 


6.5 ■ 


1.0 


0.5 


1.0 


1.5' 


2.0 


Librarian 








0.5 


1.0 


1.0 


1.0 


Secretary 


0.5 ■ 


1,0 


1.0 


1.0 


1.0 


2.0 


2.0 


Total 


3:0 


4.0 


4.5 


6.0 


8.0 


10.-5' 


12.0 



Table 4 shows the relation between the alternative 
institutional arrangements and the criteria. 



Conclusion 

There^ is presently a ^ serious disparity between the 
chaotic state of disability statistics and the order and 
serviceability implied in the call for' a comprehensive data 
system on the disabled population. Even if the Post-Census 
Disability Survey were completed, and we recommend that it 
be completed, the're would still- remain the problem of 
coordinating .the collection, analysis, and dissemination of 
disability statistical data. Congress has given the 
Director of NIHR the mandate to produce better statistical 
data through copperation with major data collection 
agencies. We view the step-by-step development of the 
Disability Data Archive and Analysis Center (DIDAAC) as the 
bestPmeans of addressing* the need for better data through 
coordination. 

Initially, DIDAAC would concentrate on the improvement 
of disability data through the archiving, analysis, and 
dissemination activities described above. As time goes on 
and the expertise of the DIDAAC staff and its overseers 
(e.g., the Subcommittee on Demographics and Statistics of 
the Interagency Committee on Handicapped Research, and the 
National Council on the Handicapped) strengthened, DIDAAC s 
function would be increasingly to help the Director of NIHR 
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Figure 1: Annual Cost of DIDAAC 
at Seven Levels of Staffin^g 
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Tnble 4; Alternative implementations of DIDAAC 

In Other Federal Agency In New Federal Agency 



Contractor 



Staff expertise 



No NIHR 5taff available 
at present 

Staff would be entirely 
under NIHR's control • " 

New staffing not likely 



Other agencies have good 
people with strong 
reputations. 

Other ^agencies are not 
likely to have all 
necessary skills. 



Same staffing problem as 
at NIHR. 



Sk i^ 1 s-,ane-^ay.a 11 able_ and^ 
canybe obtained by 
''Subcontracting or 
consortia... 



Collection 



•NIHR the owner and 
permanent home of the 
collection. 

No available staff with 
acquisition and 
maintenance skills* 



Agency would be a 
custodian, not an vwner. 

Agency would have 
necessary maintenance 
skills (or else no agree- 
ment.) ^ 



Agency would be an owner,, 
not a custodian. 

Would have the requisite 
skills for maintenance. 



Contractor would be a 
custodian, not an owner. 

NIHR would "need 
repossession arrangement 
if contract terminated* 

Would have maintenance 
skills (or no contract.) 



Continuity 



By definition, would 
exist as long as NIHR, 



Subject to NIHR funding 
and willingness of other 
agency. 



Contingent on funding of 
contract. 



Independence 



Dependent on staff 
reputation and perfor- 
mance. 



Dependent on staff 
reputation and perfor- 
mance . 



Dependent on staff 
reputation and perfor- 
mance. 



Dependent on staff 
reputation and perfor- 
mance. 



Responsiveness 



High incentive to be 
responsive. 

Subject to staffing 
limitations. 



Subject to conflicts with 
internal agency require- 
ments •and staff 
limitations. 



Subject to competing 
requests of other 
clients. 

Moderate intention to be 
responsive 



High intention to be 
responsive. 

Possibility of competing 
requests from other 
clients. 
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to evolve statistical policies that will improve the quality 
and", comparability of disability ' statistics throughout the 
federal government. 

We have outlined here the technical and policy steps 
needed to establish DIDAAC. The initial phase of-^' a 
j3jeLmojisJtxa.ti on grant shoulia be designed to address the 
immediate need for good s,tatistics, and the Ibnger-term need 
of coordinating federal disability^ statistical efforts. 
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• This research memorandum describes general technical 
considerations that are essential for evaluating . the 
internal consistency and comparability of any data files to 
be included in a comprehensive data system,^ The 
specifications in this document fit within the broader*, more 
content-bound and project-specific set of requi remerj ts . 

The acquisition of data begins with^ a search for 
available, data files or with a client's request for a 
specific file. In the latter case, there is prima facie > 
reason for considering the data f ile ' relevant to 'the 
purposes of the comprehensive data system. Files obtained 
as a consequence of a search by the staf^f of the , data system 
must be evaluated on substantive grounds for their relevance 
^nd adequacy. Once a substantive evaluation has resulted in 
a decision to acquire, a data file of unknown quality or of 
doubtful provenance must be subjected to an assay to ^ 
determine its completeness and consistency. This report 
describes the rationale and methods of a data assay. 



The Data Assay / 

^ . / 

The technical evaruation of a set of data is called a 
data assay. A data assay serves four major purpose^: 

9 

1, To assess our confidence in the quality of the ^data; 

2. To investigate the costs and benefits of correcting 
errors in the data; ' / 

/ - ' 

• 3. To provide guidelines as to which correq^^ions clre 
feasi-ble and which are not; !r 

I 

4. To provide error estimates for .data which cannot be 
corrected. 

The assay has the following institutional uses: 
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1. It serves &s a guide to data editors and those who ^ill 
* be c'orrecting the data; 

2. It .provides secondary analysts with a statement of the . 
data's qualitiy; 

3* It -provides valuable feedback to data producers on the 
quality and stylistics of their product; 

4* It , provides -an overall evaluation of the quality of a 
research or data production project. 

The assay of. machine-readable data files(MRDF) is 
primarily formal; and is affected very little by the file's 
substantive content* Part of the evaluation of a data file 
concerns its content and relevance. Such evaluation, while 
most important, is not wi-lzhin the scope of this discussion. 
The assay of machine-readable data can be^ classi f ied under 
the linguistic headings of syntactics, semantics, and 
pragmatics. 

The syntax of data is its formal structure without 
reference ,to its message." An example of a syntctttic 
characteristic of the data set ,wouldc be that the variable 
SEX is in position 91 of each record and is^ represented as 
an IBM-format numeric character. 

The ^semantics of a data set is the message or 
information it carries. Some semantic characteristics of a, 
da.ta set might be that.it has a record for each of 35,000 
hospitalized patients, and that the sex of patient 10954 is 
male. 

Th e prag matics, of a data file is the mapping of 
syntactic elements onto semantic ones. An example of a 
pr-agmatic characteristic of the data set would be that sex 
is represented by a "1" for male', "2" for female, and "3". 
for missing data. 



File-level ' Analysis 

A data set in machine-readable form is usually, referred 
to as a "file." h file, is distinguished with a name and 
other information which, allows it to be located , and 
processed with the computer. Even thou'gh^ pragmatics are at 
the lowest level of the hierarchy files cannot be e'Valuated 
until some pragmatic criteria have been met. ' Primarily, 
tapes must be readable. WhiJLe the IBM 360/370 hardware has 
provid^ a de facto standard £or magnetic tape formats, some 
old liWpes may be written in formats which require explicit 
conversion. A conversion may be complicated by the fact 
thatVit -is sometimes not clear what the conversion's- target 
shouM look like. When there are only summary statistics 
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for an unreadable file of microdata, it may be difficult (or 
impossible) to tell . whether the file has been properly 
deciphered. 

Some tapes/\whxle wri"tten in readable format, are so 
old or so poor in" quality of recording medium as to make 
reading difficult or impossible. In some cases, it may be 
desirable to attempt to reconstruct the data, ' either from 
the tape as ^ provided, or with the help of the file's 
producer. 

Structure error . The higHes.t level of error is^ what 
can .be called a file structuicie error, in which the file is 
not as described. In some' cases, a structure error is the 
result of having been given the \wrong file. At the 
beginning of an assay we note the recotd length, the number 
cof records in the file, and the content of fields listed as 
record identification in the file description. A file with 
too few or tob . many records is suspicious; a file with 
records that are too long or too short is unsuitable .for 
further analysis until an explanation or new documentation 
has been obtained. 

\ / 

Complex Files . A "complex" file' is one that contains 
more than .one -type of record. A complex* f i le , for example, 
might contain one record for each patient in a hospital and 
one record for each prescSript ion written for a patient. 
Complex files may incur structure errors at the level of the 
file or at the level of the 'individual record type. Most 
complex files-' are^ structured, having a logical relation 
,befv/een records of 'different types;^ 

ft ^ , 

One common^ error in hierarchical files is that the 
number of records at a lower level does not match the count 
that is contained as data in a higher level. While such 
errors may not be as. severe as having a file with missing or 
extra records, such disparities cast doubts on the file's 
quality. , . \ 



^ It should be noted that not all complex files need be 
structured and not all structured files need be 
hierarchical. While somewhat bizarre, it would be possible 
to combine the records of. two unrelated files into a single 
structured file. A file is structured when' it includes a 
logical* description of the relation between record type.s. A 
f ile . of data^ from a longitudinal study could consist of 
parallel records from different observation periods, but 
wit^ :no hierarchical relation. 
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Consideration in the Evaluation 
Data Files 



Record" And Variable-level Analysis 



, The first detailed' analj^sis of a. set of data is 
syntactic. Before a-ny analysis of semantics, is feasible, it 



is necessary to "deteVmTne 'whether the data file conforms 

The possible err^ors in a data set 



its description, 
'into several levels 



raai 



Syntactic Er'ror's 

The evaluation :Of a set of data includes a comolete 
analysis of the syntactic characteri^stics of every variable. 
A variable' s "syntactie range" is the set of representations 
a. variable may take. Sor example, a variable called NAME 
might be restricted | to upper case rjrinting characters ,, with 
•a- -maximum. le-ngth .of 20.; A variable called AGE may haye its 



as 3 numeric/characters 



JOKES" is 
nolrr 
but 



with leading 
within the syntactic 
- Similarly, "045" is 
"TEN" and " 4;5" 



are 



in the syntactfc range oi 



syntactic range given 
zeros.. Accordingly, "FRED 
range of NAME, but j'Fred Jones" is 
within the" syntactic, range of AGE, 
hot, (Blanks are] not includecj 

AGE.) Violations of i the syntactic range are serious errors, 
indicating ootentially grievous fjlaws in the gualit/y of tne 
data. Syhtactic errors are almost! always at the level of 
the individual variable or data item. Syntactic . errors are 
usually fairly easylto discover. Some syntactic errors are 
primarily stylistic, as when a fiie contains leading blanks 
where leading zeros Ihave been prescribed. In such cases, 
the repair or \ accommodation \ to such errors ' is. 
straightforward. 
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Seman\t-ic Errors j 

- \ \ . . 

A variable's "semantic range" is the 
may' validly convey. (While the syntactic range :of 
from '"0"to"9", its semantic range may W restricted 
-and- 9 indicating "male," "female," hnd "missing 
Semantic errors are| often described \as "wild" or 
range" values. i \ 



messages 
of SEX 



it 
is 
to 0, 1 
data." 
"out of 



Joint Semantic Ranges . Semantic ranges apply to sets 
of variables as well as to individulal va;riables. The 
investigation of semantic ranges of sets\ of variables is 
. usually- --called "consistency checking." Ror example> "Male 
\ is^ in the semantic range of SEX and "CaesaVian section" is 
in the semantic range of PROCEDURE. Huwever, the ^omt 
range of SEX and PRd,CEDURE does not idclude these two 
Values. The concept may/ be extended to any\se,t of variables 
and is exactly parallel to the construction of ^omt 
probability spaces. 
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Statistical Semantic Ranges > A statistic is a variable 
which describes a set of values foi^' another variable. What 
is commonly called the range of a statistic is here called 
the syntactic range. Where there is knowledge of a set of 
data, it is possible for a statistic to have a semantic as 
well as a syntactic range. For example/ a proportion has a 
syntactic range f-rom 0 to 1. However, if we are excimining a 
set of data from obstetrical pa t ients , • then any proportion 
of males above 0. is outside of the semantic range for this 
statistic. The " investigation of semantic ranges of 
statistics requires a considerable increment, in commitment 
to the assay of the file, since substantive expertise is 
necessary to define valid ranges. A data specialist can 
determine the syntactic and semantic ranges of variables and 
sets of variables from the file documentation. Data 
specialists can also compare obtained distributions and 
statistics with those given in the documentation. However, 
it is beyond the scope of a data specialist's expertise to 
say , w.hetrher or not- a particular statistical value or 
distribution is plausible given a data set'.s- domain of 
inquiry. ; 



Comparabi lity And Mapping 

Most assays of data do not include a spscific 
assessment of comparability with other data sets. (One 
- strong exception to this rule is in the case of statistical 
time series.) 

Since a m.ajor task in developing a comprehensive data 
system on the handicapped is the incorporation of data from 
• diverse sources, a special effort must be made to 

investigate the comparability of newly acquired data to data 
already in the .sy.stem. We can adopt a unitary or a 
/ pluralistic standard of comparability. The unitary standard 
is the minimum data set, which will serve* as the 
comparability target for all accessioned data. The assay of 
every file must include an evaluation of its potential for 
comparison with or inclusion in the minimum data set. At 
the begi nn ing , there may be some modi f ica t i on of 
specifications for the minimum data set so as to be .able to 
accommodate existing files. However, once the master file 
has been defir\ed, it will serve as the comparability target. 
Some sets of data will be comparable with each other, even 
though' none is comparable to the minimum- data' set. 
.Comparability may be in terms of observation, variable, or 
value. 

Comparability by observation . Two data sots are 
comparable by observation if their records are derived from 
the same unit of analysis.^ Thus, income records on a group 
of individuals are comparable by observation to health 
reco^rds on those same people. (it is assumed that the 
personal identifiers on the records can be matched.) 
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Comparability by variable . Data are comparable by 
variable when records referring to different units of 
observation contain identical variables. Mortality data 
from different cities are, comparable by variable where they 
contain information on date, cause of death, and age at 
death. 

' Comparability by value . Comparability by, variable may 
not be of any great help unless we can establish 
comparability by value. The trivial case of establishing 
comparability by value is where the pragmatics of one 
variable must be, matched to another. If two jsets of data 
record race as "white," "black," and "other," then it is a 
relatively simple matter to map the numeric values used in 
one da'ua set on to those used in the other. 

What is more common is that one set ^ of ^ values is a 
finer partition of the data space than another. If one data 
set gives fave classes of ethnicity while another records 
ethnicity as "white" and "noti-white" then the only way to 
merge ^ the two files is to aggregate the five ethnic 
categories into "white" and "non-white." 

Often the oroblem is not the fineness of a partition 
but the deg^ree to which two partitions of a data space map 
on to each other. A geographic example may help. " A 
variable called ZIP CODE may be mapped on to a variable 
called POSTAL R3GI0N by extractin^g the high-order zip code 
digit. A subset of zip codes could be matched to cities by 
extracting the first three digits. However, many three 
digit combinations do not refer to cj^ies but to areas that 
include several different post offices. 

It would be extremely difficult to map zip codes onto 
counties, because postal planners do not in general use 
counties as components in the construction of postal routes 
and regions. In fact, the only way to provide such a 
mapping would be to construct a much finer variable indexing 
towns and unincorporated areas, which could then be mapped 
onto both zip codes and counties. 

Similar considerations hedge the mapping of data sets 
f indexed by medical diagnostic category with th6se indexed by 
' impairment and with those indexed by functional limitation. 
In some cases, it may be possible to construct mappings from 
one system of indexing to another, but such a mapping will 
require considerable effort and the collaboration between 
those with, expertise in handling data and those wi th 
expe-rtise in research on the handicapped. 
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Richard C. Roi-stacher 
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Washington, DC 20036- 



December 31, 1981 

This standard sets forth technical requirements for 
data that may be included in the National Institute of 
Handicapped Researches Comprehensive Data System (CDS). 

Any data file submitted for inclusion in the library of 
the Comprehensive Data system should meet these technical 
standards. All tapes maintained in the CDS, library wiTI 
conform to this standard. Although most of these standards 
are simply good data processing practice, jome of the 
requirefnents are specific to the needs and facilities of the 
CDS* Additional information about these standards and the 
Comprehensive Data System, or assistance in meeting these 
standards?, may be obtained from NIHR'. 

Another aid to data producers is the Government 
Printing Office publication, *'A Style Manual for Machine- 
Readable Data Files and Their Documentation," which 
describes optimal practices and techniques for formatting 
and documenting machine-readable data files. This document 
is available as publication NCJ-62766 from the National 
Criminal Justice Reference Service, Box 6000, Rockville, 
^Maryland, 20850. 



Tape Recording Standards 

Data supplied for possible inclusion in the CDS should 
be on 9-trac.k magne.tic tape written at a density of 800, 
1600, or 6250 bytes (characters) per inch. Since more 
reliable encoding methods are used at the higher densities, 
6250 BPI tapes are preferred to 1600 BPI tapes, which are 
preferred to 800 BPI tapes.. The tape should be labeled with 
IBM or ANSI volume and file labels. If 9-track equipment is 
not available, then files may be submitted on 7-track 
magnetic tape. Either fixed- or variable-length records are 
acceptable, and files should be blocked to a length that 
will provide for efficient handling of the data. Physical 
record lengths of from 80 up tp a maximum of 32,767 char- 
acters-are acceptable. Experience has shown that physical 
record lengths of approximately 16,000 characters provide 
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for the -mdst efficient processing. ^ in general, data 
producers and users should consult with the technical staff 
Of their computing facility, to determine the facility's 
preferred size for physical records. The American National 
Standards Institute recommends a maximum physical record 
size of 2400 characters, which is considerably less 
efficient than the maximum permitted by this standard. 
Z'^tT^ ■ file-naming conventions should fee agreed on by 

NIHR and the data supplier before tapes are written While 
It IS, impractJical to devise a set of file-naming conventions 
that are universally applicable, it' will often be in the 
interest of NIHR and its' data suppliers to establish naming 
conventions for a particular set of data files. 

Data Type s 

_ . All machine-readable data records suoplied for possible 
pSrix^'^" will consist entirely of alohanumer ic , 

EBCDIC, or ASCII characters. (Length fields in binary 
format that are part of the IBM variable-length record are 
not includea in this restriction.) 

For the purpose of this standard, variable will be 
considerea either computational or noncomputational . 
Noncomputatio^al items contain information such as names and 
labels tnat are never used in any arithmetic or numerical 
Noncomputational variables may contain any 
EBCDIC or ASCII printing characters. Noncomputational items 
will be left justified and padded to the right with 
alphabetic blanks. ^ 



Data on magnetic tape are divided into physical and 
logical records. A logical record contains data from a 
single unit" (e.g., patient, gas bill, library catalog 
card, etc.). Data processing is often more efficient if 
several logical records are written end-to-end onto the tape 
to form a single "physical" record. This process is called 

blocking, and a physical record is sometimes called a 

block . 
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Computational items contain numeric information that is 
desiqned to be used in computations. . Computational 
variables may contain only the characters 0-9, and 
The only exception to this is where D- or E-format 
floating-point data are represented.^ Wherever possible, 
data should be represented in integer format with implicit 
decimal places noted in the. documentation. Computational 
variables should be right justified and padded to the left 
r with zeros or blanks. If a computational field is signed, 
the sign must ' immediately precede the. left-most numeric 
character. Computational fields must contain at least . one 
numeric character. In particular, computational^ fields 
consisting only of a sign or of blanks are not permissible. 
Fields containing only a signed zero are not acceptable, 
since some computers cannot represent a signed zero. 

Missing data . All variables for which there Tiay be 

missing data must have an explicit missing data \ ue or 

values. In particular, it will never be assumed tnat a 
value of zero or - a field consisting entirely of blanks 

indicates missing data. Missing data values must be 

indicated in the documentation as a list or a range of 

values, and may include zero, but may not. iilclude values of^ 
-0/ a blank field, or a field, consisting only of a sign. 

Missing data values must occur in the same field as the 
variable to which they refer. If an alternate value is to 
Be "ase'-d" in olace of a missing data value, the original 
variable must carry an appropriate missing data code, while 
the alternate value will be shown in a separate variable 
. that has been declared for that purpose. In no case will 
' the alternate value be carried in the ^original variable, 
with an explanation code in another variable. The rationale 
.for this standard is that the meaning of a variable should 
be determinable without reference to a second variable. If 
the true value of a variable has been suppressed or 
modified, then the value, of the variable should indicate 
such suppressicfi or modification. ' If -an alternate value is 
to be offered in\ such cases, the appropriate variable in the 
record .can then be read if the analyst so wishes. 



2 Floating-point data are used to represent very large or 
very small numbers. The number 2,625,000,000 is written m 
floating-point, notation as "2.625 E+09." The "E-09" is 
called the exponent, and indicates that the decimal point is 
to be shifted nine places to the right. D-format floating- 
point notation is similar, but uses a "D" rather than an "E" 
to mark the exponent. The number "0 . 00000000000465" ^^is 
written in floating-point notation as "4.65 D-12." the^ "D" 
indicates that the number was computed with double precision 
arithmetic, which allows more significant digits to be 
represented. 
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■ a' brief example may be helpful. -Suppose that the 
variable -'Number of. Occupants" is sometimes estimated rather 
than counted directly, and that estimated values are to be 
flagged. "Number of Occupants" might be assigned values of 
999 for "missing data" and "998^" for "estimated". A second 
variable, ■ "Estimated number of occupants" should be used -to 
supply the estimated value 'for cases in which "Number of 
occupants" is 998. Where "Number of occupants," the 
original variable is not .an estimate, "Estimated number of 
occupants" should haVe a value which denotes 
"inappropriate." In no case should these variables be 
organized so that "Number of occ'upants" carries both counted 
and estimated values, while "Estimated number of occupants" 
carries, e.g.;^ a 0 fo.r observations in which the number of 
occupants was counted, and a 1 for observations in which the 
number of occupants was estimated. 



File Organ ization 



■ Variables . Records transmitted for 'possible inclusion 
in the CDS should contain no undocumented or irrelevant 
fields. The width of a variable should be sufficient to 
accommodate the entire range of variation that ma^ oe 
expected' of the ijja^i^ but should not be excessive. 
Conv.ersely, fields need be no wider than is.requirpd to 
accommodate- the maximum expected value of a variable. Thus, 
the variable "number of diagnoses" for a clinical record 
probably need not be larger . than two characters, and 
certainly no larger ^than three. 

In any case, the maximum allowable field wi.dth for an 
integer variable on tapes submitted to NIHR is 15 char- 
acters. Larger numbers should be represented by supplying a 
scalina factor' in the documentation, or by the use of 
floating-point format. A noncomputational variable may be 
up^o 32,767 characters long. 

Dates . The Federal Information Processing Standard 
(FIPS) format should be used for the recording of dates, in 
machine-readable form* A date in FIPS format is expressed 
as six numeric characters, YYMMDD, where YY is the' last two 
digits of ■ the year, MM is the ordinal number of the month, 
and DD is the day of the month. In this format, February 3, 
1983, is written as 830203. Where conciseness is paramount^ 
dates may be exoressed in the alternate FIPS format of 
YYDDD, where YY is the last two digits of the year, and DDD 



' A scaling factor allows fewer characters to be used in 
representing a variable. For instance, $50,000,000 can be 
represented as "50" if the variable is defined as "Dollars 
in millions." 
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is. the drdinal \number of the day in the year* In this 
format, February 3, 1983, is written' as 83034, 

Re.cord type identif ilcation . Each 'record in a file con- 
taining more ~ than one 1 record type, e.g., a hierarchical* 
file, will carry a variable that identifies the type of 
record. The record identifier must be the first variable in 
each record, ^and' must have clie same format for all records 
in the file. Even • where! each record in a file has .a 
different length, a variable identifying the type of record 
will be included. A new value of this identifier will be 
used whenever there is a significant change of the 
procedures used to generate that type of^ record. Such 
chancj^s include changes not only in record layout, but also 
in instrument design, data 'collect ion , and coding. The need 
for»a new record type is obvious when coding instructions or 
code values, arc changed. However, even when such changes 
_cp,nsist only of the addition of new coding categories to 
existing variables, a new record type should be produced. 
Otherwise, analysts may net properly interpret the absence 
of a particular response. 

Record identification item^. Each record in a file 
submitted to NIHR must carry an identification number unique 
to the data file. If no existing variable will suffice as a 
unique identifier, then a sequence number will be assigned 
by the CDS library. Where a file contains more than one 
type of record, each record shall ^ carry a variable 
identifying the type of record, as well as a unique sequence 
number. Where the records in a file repres*eat,_ a hierarchy 
or tree, a r.ecord will carry unique identification 
sufficient to identify it and its position in the hierarchy. 
In particular, it should not be necessary to infer the 
location of a record in a hierarchy solely from its position 
in the file. 

- For example, consider a file consisting of household, 
person, and incident records. Each recorc' in the file must 
begin with the same four data items: a record type 
indicator, a household identifier, a person identifier, and 
an incident identifier. An incident record wi 1 carry a 
type and an incident identifier, as well as the ntifiers 
of the person an household to which it belongs. A person 
record will carry type identifier, the identifier ot the 
household to whi it belongs, its own identifier, and a 
dummy incident identifier. Record identifiers should be 
^positive integers. Dummy identifiers should be fields of 
zeros. Blank identification variables are not permitted. 

In general, identifiers of lower-level records need be 
unique only within leve,l, since concatenating identification 
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var^iables generates a unique-idenbH4err*-Quest-i-ons as -to 
""wKeTher sequence -numbers should be unique within record type 
or Within levels should be resolved by agreement between the 
data\ supplier and NIHR before the file is generated. 

Vrhe rationale underlying the assignment of a unique 
identifier to each record is that users of a data file 
should -be--^Terto^er form arbitrary sorts and to subset the 
file yithout requiring the use of any facilities other than 
sort program and a file-copying utility. 



Standardization .Of Data Codes 



Where possible; standard data codes should be used. 
There are no universal standards, but it is often possible 
to fir\d ^n existing and well-constructed set of codes that 
is appropriate. , "There are two reasons for emphasizing the 
adoption of existing coding schemes . ^ The^'. first reason is 
that the' a^doption of standard coding schemes^aciTitates the 
f^Htiking .of \ files. The second, and mure important, reason is 
\ that ^ the \development ,of coding schemes for variables with 
Xlarge^ numbers of categories is a considerable project in 
\tself. It is risky and 'uneconomical 'fpr a project to un-^ 
dertake the ^development of its own coding scheiAe for such 
thinf . as' \diagnoses, geographical locations, occupations, 
mental disea'^es, etc. The choice of which "standard" coding^ 
scheme is usfed is not so important as that ad hoc coding 
schemes not \ be used if equally good coding schemes are 
already in use. 

Data suppliers subject to this standard are encouraged 
to consult tl^e following sources of information on standard 
coding schemes. i 

Federal Information Processing S tandards (FIPS ) . The 
Federal I nf orii^at ion Processing Standards published by the 
National Bureau of Standards are the official Federal 
staihdards for several coding schemes. FIPS coding schemes 
should be 'considered for geographic area variables such as 
states, counties. Standard Metropolitan Statistical Areas 
(sfeAs), populated places, etc. FIPS; incorporate several 
AnMrican National Standards Institutes -(ANSI) standards 
wMch have been adopted for official Federal use. Data 
suppliers can obtain the current list of FIPS publications. 
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To concatenate identifiers is to string them together into 
single identifier. Consider a file of households with 
/people in them. Suppose that each household has a three- 
/digit identifier which is unique within the file, and each 
(person has a two-digit identifier which is unique within his 
or her household. Then person number 4 in household number 
207 is uniquely identified as person number 2070^. 
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-NBS-PWLica't-ions^Li St - tp^om^ the Office of Standards 

Administration, Institute for Computer Sciences and 
Technology; National Bureau of ^Standards/ VJashington, 
D. C. 20234, telephone (301) 921-3157; 

Bureau of the Census ^ The Bureau of the Census has its 
own set of geographical area codes as well as a continually 
revised set of occupat ion and industrial codes . • Data 
suppliers whose files .are related to Census products should 
communicate with the Dalta User Services Division, Bureau .of. 
the Census, Washington,: D. C. 20233 to receive copies of the^ 
latest Census coding schemes. 

Office of Federa:! Statistical Policy and Standards . 
The Office of Federal. Statistical Policy and Standards, 
S. Department of Commerce, serves as a central clearing- 
house for data representations and' coding schemes. Da:ta 
suppliers who- are unable to locate a satisfactory data 
coding scheme are encouraged to communicate with the Office 
of Federal Statistical Policy and Standards, 

U. S. Department of Commerces^, Wasliington, D. C. 20230, 
telephone (202) 673-7956. 



Abstract ; 

'when data files are submitted to the CDS, the produceir 
must provide both NIHR and the CDS with an abstract of t-he 
file's form and contents. The .abstract must include 
bibliographic. and technical information sufficient to meet 
the requirements of the Office of Federal Statistical Pplicy 
and Standards concerning public use data files released^ by 
Federail Agencies. The writing'of the abstract is entirely 
the responsibility of the f ile ' s producer . The abstract 
must contain all elements listed below\which are relevant to 
the form and content of the file. 



1 . Date of the abstract. 

2. Unique L-ile number (producers number) 

3. ' Bibliographic ci tat ion . 



3.1. Title . The title should be descriptive of the 
contents of the data f ile and should be 
followed by the date of the data program, if 
applicable. The title may be changed by 
..if it does not clearly delineate the file 
other files. 



*;iHR 
from 



3.2*. Statement of responsibility. The agency or 
agencies responsible fo*r the file's 
substantive content. If a contractor produces 
the ^ file, \ both the contractor and the 
contracting agency shouId""Be~'included . ^ , 
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3.3. ' Place of production. The formal mailing 
address of. the organization whichr produced the 
- . £'ile. 

. 3.4'. Producer. The organization which' actually. 
^ produced the file- This entry is^ usually, 
. redundant to the statement of responsibility. 



' 3.5. 



Date 



of^ production , The date the file became 

operational in a computier ized form and avail-, 
able 'for possible 



inclusion in the CDS. 



4-. G enera l 

4 -.a-. 



description. 
\ 

Type of f ile > 
files, aggregate 
etc. 



Types of 
data fiiles 



files irvqlude text 
, microdata f iles , 



5. 



6. 



8- 



4.3. 



Universe 



description . A 
by 



description of the 
the data in the file, 
sampling method was 



population covered 
including what, if any, 
used. • 

Subject matter description . A des^Ti^ptioh 
the types of variables included op the 
and any other comments about , the- purpose o 
the <aata collection, the s'ccpe of the study 
•or ^riy special characteristics of the study. 



of 
file 



Time coverage. 
the file refer, 



The time period to which the data 
including dates of collection. 



m 



Geographic coverage. Information concerning the total 
geographic scope of the fi]^ and additional levels of 
geographic coding prov ided within the file. 



Technical description 



A brief description of the 
file sTfucture (hierarchical , rectangular , etc. ) , the 
file size (the approximate number of 'logical records 
in the file), and ^ny information about the format of 
the file (SPSS, SAS) or dependency on specific 
computers or software. 

Reference or related materials. The bibliographic^ 
citation of all- materials that relate to the machine- 
readable file or its technical aspects such as the 
t'echnical documentation ' or, that include 
results, or tables derived from the data. 



analysis , 
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9. Related machine- readable 



files. The bibliographic 
data or software files that 



. * citation of other related 

are available to the public or qualified users. 

10 • Office of Management and Budget clearance number--. ^ .The 
number given by 0MB tb any, form(s) or plan(s) used in 
the data collection. Please indicate if 0MB clearance 

was not requires. * ^ 

^ ^ « 

11. - Terms of availabili:ty. Any restrictions on access to 
' ^the file such as privacy cert if icat^ion and* a contact 
persorTCs'^; an the producing organization who can 
discuss- the substantive and computing aspects of the 
file. The name, full address, and telephone number of 
the contact person ( s ) /must be included. 



^ Do'^cumentat ion 

— 

Each tape submi tt^d . f or possible inclusion in the CDS 
must be accompanied by documentation giving the physical 
characteristics of -the volume and files as well as the 
logical composition of each type of record. Wherever pos- 
sible, the documentation should be. in machine-readable form 
and supplied both in hard copy and as a file on the tape. 
Machine-readable documentation is preferred because it 
prevents the separation of documentation from .data and 
because the physical quality of the documentation will not 
be degraded by repeated copying. 

Machine-readable documentation should be in printer 
image. If a document processing program is used to format 
the. source text, it is^requested that the source" text of the 
documentation and a reference to the text .processing 
.software be included as^ well. (This sta^hdard makes no 
' speci f icat ion .of any particular document processing program, 
nor does it require, that such .a prog^ram be used- at all.) Thfe 
minimal documentation of a data file consists of a tape 
volume table .of contents, a character and octal or 
hexadecimal dump of a sample of records of each file, infor- 
mation sufficient to construct a bibliographic citation to 
the. machine-readable data file,^ an abstract of the file's 

form and content, and a minimal codebook. 

j 

' Where appropriate, data files ,should be accompanied by 
copies I of the original collection instruments, including 
survey quest.ionnaires and interview schedules. Copies of 
editing and coding instructions used in the creating the 
data file should also be included. 
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,Tape Table Of Contents 



/ 



TJhe tape volume table of contents lijsting siiould 
include all information fWom the volume label and from the 
file Isabels. /Information should be in an easy-ljto-read form, 
I At^mr^ rsf i-Ko i- o v ^ of h l^hpl<;_ The following 

cpnbents 




rather! than ay dump of the text of the labels, 
figure is an Example of a table volume stable, of 
listing. ^ ' * ; 

It possible, the tape table of contents shouid\be 
producfed' by a program which also veri f ies'TTTe -readability bf 
the tap4. If no tape listing program is available, then t^ie 
tape table of contents should be projiuced manually, ^^ing 
information from the job which produced the tape or a duni^' 
of the tape. (Since tape listing programs/ are becoming 
increasingly available, it is suggested tHat data centers 
withoutji such progra'ms. attempt to acquire them.) ; \ 



Minimal Codebook 



The codebook included with the 
-least ithe following information for 



r 



file must contain at 
each variable: 



l.» A reference number. 



\ 2. An unambiguous name for the item. 
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* Is. \h textual description of the item, or the text of the 

^' 4^estidn, if from a questionnaire. 



4. The starting location, > width, Ibcation of implicit 
'dec'iTffal point, or scale factor. 

5. Missing data codes and their meanings. 

6. ^The mode in which the variable is^ represented, i.e., 
numeric character, alphanumeric string,, floating-point 

binaryj etc. 

f ;^ 

The codebook must also contain a Mst' of the valid 
laluesi for categoridal items and valid ranges for continuous 
item^. Missing data codes must be documented in the same 
ashion as other values and not left implicit. 
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Frequency Tables ' . . 

.A frequency distribution for each categorical variable 
must 'accompany each file submitted for. possible inclusion in 
the CDS. The mean, standard deviation, range and number of 
cases of continuous variables should also accompany the 
file. Values that fall outside of those defined in the 
codebook should be annotated if they cannot be corrected. 
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IBM-LABELED 6250-BPl 9TP V0LUME=RSW0O9 OWNER^^CAC .U-I LL RACK;I^=C4524 
LP'ON BLK=ON RING'OUT OTCHK=ON RETRY=10 



FILE 

It DATA SET NAME 



BLOCK RECORD TAPELTH 
COUNT COUNT (FEET) 



1 NCS73.NAT.C01 

2 NCS73.NAT.C02 

3 NCS73.NAT.C03 

4 r:CS73.NAT.C04 

5 NCS74.NAT.C01 

6 NCS74.NAT.C02 

7 NCS74.NAT.C03 

8 NCS74.NAT.C04 
TOTAL TAPE LENGTH » 

•><*><*;^ END OF TAPE 



1233 
1234 
1 187 
1212 
1086 
1083 
1072 
1 100 



125858 
126442 
123279 
125436 
1 12135 
1 12086 
1 1 1090 
1 13455 



2114.07 FEET 
<*><*><♦> 



283.07 
283. 17 
272.60 
278.21 
249.39 
248. 84 
246. 18 
252.62 



RECORD 
FORMAT 

VB( 15250 
VB( 15250 
VB( 15250 
VB( 15250 
VB( 15250 
VB( 15250 
VB( 15250 
V8( 15250 



305) 

,3or ^ 

301 
305) 
,305) 
305) 
305) 
305) 



BLOCK LTH 
AV. MAX. 



15141 
15135 
15146 
15139 
15139 
15148 
15138 
15*14 1 



CREATED EXPIRES USER 
DO n:mm yy do MMM YY I.O. 



15250 
15250 
15250 
15250 
15250 
15250 
15250 
15250 



23 MAR '77 
23 MAR '77 
23 MARi^77 
23 MAR '77 
2 J MAR '77 
23 MAR '77 
23 MAR '77 
23 MAR 77 



BATCH 
RECEIPTS 



SGDA 




SGOA 




SGOA 


^91127 


SGDA 


691 127 


SGDA 


691 127 


SGDA 


691 127 


SGDA 


691127 


SGOA 


691 127 







Figure 1: A tape volume table of contents. 
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A Data Access and Archiving Policy 
For the National- Institute of Handicapped Research 

Research, Memorandum Number 4 



Richard C, Roistacher 
Bureau of Social Science Research 



Washington, DC 20036 



December 31; 1981 

The National Institute of Handicapped Research has the 
mission of coordinating research and information on the 
nation's handicapped. Policy r:l3 of the National ^ouhcil on 
the Handicapped ' s Polic ies Governing the Nat ional Institute 
of Handicapped Research states: 
^ - I ^ 

Strategic support goal . In order to advance 
its goals for re£5earch and training, NIHR will 
initiate the development of a core capacity for in- 
house data management, knowledge transfer, 
technology evalua t ion and assessment , and publ ic 
education desijgned to assrre more effective planning 
and to accelerate translation of research results 
into pract ice by consumers , providers , and the 
public. This policy should be implemented in 
conjunction with. Policy #32. 

« 

Jn order^, to manage data and transfer knowledge, it is 
necessary that NIHR have a policy concerning data access and 
archiving. Without such a policy, data collected atS NIHR's 
expen^se may be lost to further use after the data's original 
purpose has been ful'filled. Much of the data on the 
handicapped is what can be called operational data. 
Operational data are subject to continual change as they are 
used to represent a current , state of affairs. However, 
large amounts of ^tja-tic-^data are also collected. Static 
data are colTected during a finite period of analysis and 
^are"then analyzed in a completed form. 

Some static data are collected on a recurring basis as 
part of legislatively mandated statistical series. Others 
are collected as part of large one-time studies, while still 
Others are, Collected by individual investigators under grant 
or contract. 

In spite of the enormous amounts of money and effort 
invested in their collection, r 'st data are analyz.ed only 
once and then forgotten. Data, are left in whatever [. 
condition they were in at the time the initial analysis was 
completed: data and documentation are often destroyed or 
mislaid. 
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An inc^reasing amount ol emphasis is now being placed on 
"s.econdary analysis," research on data collected by others 
and for other purposes* Secondary analysis not^ only 
provides for economy, . but also allows the testing and 
verification of earlier conclusions and the conduct of 
comparative research, even where such research was not 
contemplated bv the original collectors o^ the data. 

The policy and scientific potential of secondary 
analysis, as well as it's great economy,, make it important 
that all government data collection and analysis policies 
include a consideration of data archiving, secondary access, 
and secondary "analysis** Indeed, present federal law 
provides that copies of all federal data sets be deposited 
with the National Archives. However, unless policies, 
standards, and procedures for • implementing archiving and 
secondary access are produced, the depositing of tapes will 
be of little use to future researchers and analysts. This 
paper suggests a data archiving and accessing policy for the 
National Institute of Handicapped Research. 



The Life Cycle Of Research Data 

Research data begin as the responsibility of their 
collector. Even though the collector may be v;orking under 
the strictest of mandates, it is impossible for. -the 
supervising agency to maintain any supervision of the actual 
data. Generally, a primary collection of data is followed 
by an assay of the file resulting from the primary 
collection. Where the assay shows systematic deficiencies, 
.the collection methods are altered and collection proceeds 
in one manne^r, pr^anotrhei:' To a conclusion. .Most of the time, 
the-sponsor's only contact with the data is through a 
report. • In some cases, a copy of the data file may go back 
to the sponsor for in--house analysis. In a few cases, the 
sponsor' and the data collector collaborate in a joint 
analysis of the data. 

In most cases, once the report has been written, the 
data are forgotten. However, if proper policies have been 
implemented, data and documentation can be passed at 
relatively modest cost from the original analyst to a data 
preparation facility and thence to'' a final repository. In 
some cases, the sponsor may wish to use data after the 
conclusion of the original analysis project. In other 
cases, the sponsor-^ has no immediate use for the data but 
wishes to' make them avai lable to others The original 
analysis project should leave data and documentation in 
reasonable order and ready for at least .minimal processing 
by an archival facility. At a minimum, the archive will 
assign bibliographic control information to the data and 
documentation, and copy them to an archival storage medium. 
In some cases, the archive facility may perform extensive 
assays and recodes of data, or may write extensive 

S3 / 
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;additional documentation. However, data collected under the 
aegis of a good secon'dary access policy should require 
relatively little work from the archiving facility. The 
data's permanent rep9sitory can be either an archive or a 
library. An archive provides prospective users with access 
to the data, but does not provide analytical services. A 
data library provides both data and services. In some 
cases, services are limited to reformatting and sUbsettihg 
of ^data and documentation, while in other cases a full range 
of analytical services is offered. 



Primary Data Use NIHR 

The National Council on the Handicapped' s Policy #32 
states : • 

Data system . NIHR will develop a national data 
system designed to be 'comprehensive* from the point 
of view of NIHR's mission. NIHR will "^develop 
capacity to maintain access to, evaluate, utilize, 
store, and retrieve, on a cost-effective basis, 
selected primary data and data on other banks 
relevant to the mission of the NIHR. Such data vill 
include, but may not be limited to demographic and 
related data on handicapped persons and data on the 
results of research. NIHR will take steps to make 
such data available, usable, and accessible Xo 
investigators supported by the NIHR and other 
Federal agencies, and, as appropriate, to consumers 
and providers and their organizations. NIHR may 
charge reasonable fees for such services. 




\ NIHR thus is mandated to be the primary user and an 
archival facility for its own data. However, it is neither 
necessary nor optimal for, NIHR to be responsible for all 
secondary use of its data* 

o 

There are five major categories of secondary users for 
NIHR data: NIHR itself; other government agencies; colleges 
and universities; industrial and public users; and 
posterity. ' For each of these categories of users, there is 
a primary depository for data. NIHR itself can maintain 
temporary control of data which are of current interest. 
Other government and public users would probably use 
National Technical Information Service as their supplier for 
government data. The Inter-University Consortium for 
Political and Social Research is the country's largest data 
and archive and di'stributor for academic users.- Finally, 
the National Archives is the legal repository for government 
data' and can provide for their redistribution to all 
categories of users. 
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Responsibili ty Foir Static Data 

Dynamic data are those which are undergoing continuous 
revision, either because they are part of an on-going 
collection effort, or because they are designed to reflect a 
changing, \situation ; • While the policies outlined here are 
applicable to dynamic* data, such data are so strongly 
affected __bZ_local technical factors that a*general ^ata 
policy mSTst be"'^applied .with ,,extreme care. However,, at some 
point a aynamic data base will become static, ei ther ^b.e.ca^usje 
the collection effort has ceased or because the data has 
been abstijacted from the dynamic data base for research or 
statistical purposes* 



f 

( The Processing Cycle 

A research^ data set passes thrdugh primary archival, 
^and secondary phases of processing. Primary processing is 
done by the collection or analysis organization. Primary 
processing should produce two classes of products: research 
reports and a file of data suitable for archival processing 
or secondary use. A NIHR policy on dara archiving and 
access should insure that investigators produce a machine 
readable data file and accompanying documentation which 
conform to basic standards of syntax and style. 

— -^Archival processing includes th^ assay and editing of 

data, preparation of documentation in a form suitable for 
distribution, the addition of bibliographic identity and 
control* In some cases archival processing may include only 
a quality control check and the addition of bibliographic 
information. . Large data collection projects may do their 
own archiVal "processing; small projects may do some archival 
processing, but will seldom be able to write their own 
bibliographic control and cataloging information. Archival 
processing ends with the transfer of the machine readable 
data file and documentation to a permanent depository. 

The NIHR data archiving and access policy should insure 
that data , documentation , and bibliographic control 
information conform to appropriate standards and that data 
files are deposited in the appropriate archives. 

Secondary processing is done as part of analytical 
efforts independent of the initial analysis. A NIHR data 
archiving and access policy should be directed to insuring 
that machine readable data files, documentation, and 
bibliographic information are disseminated to the widest 
possible audience. 
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The policy should insure that to the greatest extent 
possible, "secondarv analyses can be carried.. out without 
assistance from the archive, from the primary analysis 
organization, or 'from- NIHR. 



Tasks And Responsibilities 

A data archiving policy must assign responsibility in 

five major areas, of data" processing and use: setting of 

standards, quality control, processing and analysis, access, 
and archival deposit. ' , . 

• S etting -of standards . NIHR must be responsible for 
setting of data and -documentation standards' for both primary 
and secondary analysis. independent investigators should be 
provided with a set of minimal data^ and documentation 
standards included as _Bar_t^o£-^t-heir research grant or 
contract condijti-ons. ^At the end of the grant or contract, 
„they- — shouTd"provide NIHR with a tape of their research data 
accompanied by minimal documentation. Large contractual 
projects may be required to meet data and documentation 
standards up to and including preparation of complete 
archival files. The standards set by NIHR durfng the 
primary "and archival processing provide the basis for 
standards during secondary processing,^ since the initial 
standard can only be maintained or degraded. 

Quality control . During primary processing, 

responsibility for quality control of data rests entirely 
with the collector or analyst. Maintenance o£ data and 
documentation- quality is intrinsic to the conduct of good 
science and cannot be delegated. 

During archival processing, the archive facility may 
conduct additional , assays ' of the data, both to serve as a 
reject on the original collection effort and to correct 
errors which may have evaded the or iginal analyst . 

Processing . Processing for analysis is intrinsic to 
the primary analysis and has no analog in the archival 
phase. .A data archiving and access policy should be 
directed to seeing that methods of analysis are disclosed in 
order to facilitate,%replication and comparison. Analytical 
methods should be documented in research reports and 
publications. 

Access. During primary processing, data are usually 
■ the intellectual property of the collector and analyst. In 
the case of large contractual efforts, NIHR may consider the 
contractor to be an agent, with ownership of the data 
residing directly with NIHR. During the primary analysis, 
access »should be at the discretion of the data's owner. 
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After analytical processing, either an active of 
passive access to data may be provided. Passive access to 
data is much like that provided by a library. .Data files 
and documentation are deposited in public facilities, and 
bibliographic information added to catalogs, information 
retrieval systems, and publication of data availability. 
However, the depository does nothing more than , provide 
copies of tapes and documentation on demand. Active access 
to the data is provided by a facility which augments passive 
access by performing data processing and analysis tasks at 
the direction of "^clients. Active access to data requires 
not only additional skills on the part of the access 
facility, but also a continuing contractual relationship 
with clients. Passive access can be provided on^ the basis ^ 
of one time fees for the copying of tapes and documentation. 

Archive deposit . A data access and use policy should 
be directed at ensuring that copi.e^, of machine readable data 
files and documentation are deposited 'in institutions which 
will provide them to the widest possible appropriate 
audience. At the end of the primary processing, data should 
be deposited with an archive processing facility under 
contract to NIHR. The facility should have the mandate to 
assay data and to demand information necessary .to improve 
f-iles which do not meet the prescribed standard. Primary 
collectors and analysts should be required to respond to 
requests of primary archiving facility* 

After archival processing, machine readable data files 
should be deposited in appropriate archives. At a minimum 
these archives should consist of a machirre'^eadable archive 
division of the National Archive^s and Records Service, ~ the' 
National Technical Information Service, and the Inter- 
University Consortium for Political and Social Research. In 
addition, bibliographic information concerning data and 
documentation should be disseminated to the library of 
congress. Such a dissemifiat ion policy will make NIHR data 
available to the widest possible audience, while at the same 
time relieving NIHR of the task of providing continuing data 
support to secondary analysts. 
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• ; . < * NATIONAL/FEDERAL 

DEPARTMENT OF AGRI CULTURE ■ * . " ' '\ 
- Nationwide food Consumption Survey (1125) 



/OEPARTHENT OF COMMERCE • . 

• i . ' 

'# Bureau, of the C ensus^ . ^ ■ 

^ , . ^ 

- I98O Dec\innl;al Census of Rqpulatjon and Housing (1020);.^,^ 

• 1970 Decennial Census of Population and Housing (1025) 
^ Current Population Survey (1021 - 

- Annual Housing Survey^ (1622)' ^ , . - ' ^ ^ 

• 1976. Survey of Income and Education (1023)'''' • ^ ' . ' • 

- '1967 'Survey of Economic ,Opportuni ty (IO66) 
^•Continuous Lorigi tudinal .|Manpov;sr Study (CETA) (1085)'"' 

-Research Pandl., Income Survey D.evelopnlent Program,- 1978 and 1979 (1127) 

- Post-Census Disability Survey: Richmond Pretest * (102^) 

DEPARTMENT OF DEFENSE " j . '' . 

t " ^ ' 

e Office of Civilian HealtH .\ , " " 

• - Medical Program of the JUni formed Serviies (CHAMPUS)' Data Base (1175) 
DEPARTMENT OF EDUCATION [ . _ 

^ Bureau of Education -for— the Handicapped 

- National "Speech and HeaHng Survey (1083)"*' , ' F 

' U Handicapped Children's prly Education Program (13Q1>^ * 
^- Bureau of Education for the Handicapped Qounts (1166)^ ^ * 

^ Offi ce of. Special Education ^ ^ 

• 1 ^ : . 

Child Count Data Base (13p2) 

# ftet^abi li tation Services Administration 



•rvState -Federal Prbgrarn bf Vocational Rehabilitation (llOlO 
T.". ► National Spina^l Cord Injury Data Base ( 1 109)'"''" 
r 197^ Sheltered Workshop 



Survey, Green lei gh Asso. (1182) '» 
- Survey ofoHearing Impai^d and Mentally Retarded Persons in 
State. Institutions (1170) ^ ■ * 

Medicaid Impact (II60). , ; 

irvice Needs ^udy ("Urban Institute for the 
Admi.nistration) (10^f6 to 10^8, see below) 



- 1969 National Survey of 
— 1973-7^ Compr-ehensi ve S 
•Rehabi' 1 i tation Services 



- Vocational Rehabi 1 i tnti^Dn Rejection purvey (10^6)'^'^ 

- CMRC Service Recipient 'Survey (lO^*?)"""'' . 

- S'ujivey *of 'Providers of Rehabilitation (10^8) 



RSA^;\ Datd Link (1027)- / 



^Cover page avai ]able# 
Detailed description available. 
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n.-n.. nf Pl.nnma, BuW , ^"H f' n1 nnt ■ on/Of f i cc of Education. 
■ -^ Assessment of Seleaed Resources for Severely Handicapped Children 

. -and Youth: (1086) ■ x 

. ■ - Project VO-ED: A National SurveylMb^) . 

■ A^mi n t s t ra 1 1 on on De ve 1 o nmhn ta 1 ■ P i sabl 1 1 1 1 es 

■-•State plans for Developmental Dlsabllit|esjXl3ji8) : _ 

OEPARTHHNT. OF HEALTH AMD. HUMAN SERVICES (,DHHS)^ • * . 'I 

. Administration of Aa mq • • 

' Americans (lUlM ■. , . • . 

■ —l''''°"'\lZ7cXTlTnl ndian Elderly (1126) 

. : SltrrLUttetlp Fro: ;;; ^nend, (Two National Telephone Survey. 

of the Elderly) (1130) ; • , 

. ft Health Care Financing Admini stration . ^ •/ 

''^:?:irand'pe^od!: ^renlng,. Diagnosis, and Treat.en.t:Da.a-(-l(7l) ' 
^ H..m;,n Develoom <-nr <;p.rvi ce-AA^i ni sf ration of Children, 

Youth, and Fami.lics " - / i m - " 

■ ^Annual Surviy of Headstart Handicapped ^fforts (1973 to ^f^^^^ ll^^^^o.o')^^^ 
■.National Survey of Agenc-i.^s Serving the Deve opmenta^^ ly D.sabled (105u) , 

Cr»riWl <;prv;ces RcportTng equiirements (1094j /,1Vn^ « 

: ^orio^upltuo? of'pbrme? .cadional Rehabilitation Cl.ent, (U120) 

- Dev elopmental Disabilities Uoject on Residp.nti^l "Services and 
'Conimunity Adjustment (irTS,:?-.v ' . . ' 

• bf-fice for C ivi 1 Rights . 

- Survey of Indivi'dlliTTiTrResidential Institutions (in planning) (1061) 

CKildren and Youtji Referral Survey (130i4-)"'" * . 

- Elementary and Se'condary School Civil Rights Survey (11.65)" 

' - Speci-al 'Purpose Faci I'ities (1306) , \ , , • \ ' \ 

- Survey of Impaired Individuals in Households (1062) (In Planning) ^_\/;) 

dHHS—PUBLIC HEALTH SERVICE,' , / • ' _ 
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• National Center for Heal'th StatiCstics / 

-'National Survey of Personal Hea i th Pract i ces. and Consequences (1006)"" 

- Health Interview Survey '(HIS) (1001)"""' • ox--— ' 

- I;lealth aiid Nu' i t ion, Exarrii nation Survey (HANES) (1003)"" 

- The National hospital D i scharge. Survey (1007) , nnQ\ 

- Resident Places Survey.I- Nat'l Survey of Nursing Homes - 1963 JOOKJ ' 
-Resident Places Survey I I - ^ ^ ' " 

- Resident Places Survey III - " " - ^9^9 (.010)^ ^ 

- 1969 .National' Nursing Home Survey (1011) ' 



- .1*973-7^4 National Nursin'g Home Survey (1012) 
-'-1977 National Nursing Home Survey nniM"" 

- 976 Survey of institutionalized P<^;;^°"^ j ° survey (iri2)^^*' 
• ^ IJyo Health Services Utilization and Expenditure Surv y 

Master Faci"nYies"Survey -(TO 19) ° • . 

-Health Services Utilization and Expenditures \\}_}/-) 
-* 1975 National Ambulatory MedicaT Survey (1015)"" . 

■ . National- Natality Surveys (10l6^•"• , 
-National I nfant Mortal ity Suryeys( 1017). 

-National Mortal ity Surveys (1018) • . . 

-Congenital- Anomalies and.Birth Injuries Among Live Births: United , 

States. 1973-7^ (ll8i+) 
Health Resources Administration 

- BCHS Reporting Requirements (1028) 

■ -BCHS Medically Underserved Area Data Base (1029) 

-» ■ e 

p^ l^ea^t:h Servi ces 'A'dini n i strati on - . . - . - 

■ -.Indian Health Service Health Care Statistics System (1308) 

O rf^nter for . ni'^pase-Control • 
■ . Birth.Defects Monitoring Program (BDMP) -(1309)" 

♦ W..tiona1 Centpr for Healtir Servi-ce s Research , ^ 

- 1977 National Medical Care Expenditure S'urvey (1073)"" ,,,,.n.v.v 

•-Analysis of^National Survey Data on Access to Medical Care [\02b) 
ADAMHA 

. Survey of Patient Cha'rac^r isti cs-S,tate and County "-"^^^ "°^P;^^^^ 
.. iample Survey of Addi tions. to "I npati ent Services-State and County 

I^npalnt Additions-Private Psychi atr i c Hospi ta Is (1202) 
llample Survey of General Hospi tal . 1 npati ent Psychiatric Unit 

.?n:ent;?rpf^?-Prehensive Co..unity Mental Health Centers (120.)^^ 

- Inventory of Transitional Mental Health Fac. 1. ties (1205) _ 
. Sample Survey of Additions to Outpatient Psychiatric Services . 

or Clinics (1206) ' . . . /w . /^onQ^ 

. inventory of General Hospital Psychiatric Services (209) 

- Sample Survey of Additions to Partial Care Programs (1210) 

- Sample Surve^ of Clients Under Care in Outpatient Psychiatric 

JwirLrvey^l^f^cn^nt^^ Care in Partia 1 . Care Programs (1212) 
: la^ple SurTy of Admissions to Veterans Administration Medical 

' . Sa:prsurvlf of" Discharges from Residential Treatment Centers for 
EmotionalTy Disturbed Children (121^)^ 
. Biomedical and Behavioral Study of Aging (H^/^ . 
-The Epidemiologic Catchment Area (ECA) Program (1183) 



• Inventory of Mental Health Facilities (1208) 

• Mental Health Demographic Profile System *( 1207) . 

DEPARTMENT OP HOUSING AND URBAN DEVELOPMENT 
r._-.XongLtudi.naI_(xen-yea.c)„S.urvey_ojLHojj^ (111 .plan" '"9). _.L. 

■(^059). , ,v ^ - 

« Public Housi-ng Reporting System (IIOh) 

DEPARTMENT OF LABOR 

- Rearranged Jofk Schedules for Handicapped Employees in the 

- Private Sector (iOi»5) ^- . . . •; 

. 1973-76 Sheltered Workshop Study (IO5I) . . . ' 

-National Longi tudi.nal Surveys (1105) - '/ , \ ' 

• Survey of Disabled Veterans of the Vietnam Era (10^5) 

- BLS OccupationaTl njury and 11 Iness ..Records ( 1 1 69) 

DEPARTMENT OF TRANSPORTATION • ■ • ^ 

- Sur.vey of Transpprta_tion Problems of Transportation Handicapped 
Persons in Urban Areas (1129) . " i • 

■ - 1976-77 National Survey of Transportation Problems of Handicapped 

Persons (1097) ' . 

ACTION ; ' 

- Senior Companion Program (SCP) Evaluation (1128) 

- Foster Grandparent Program Evaluation (1151) 

LIBRARY OF CONGRESS , * - ' . 

- 197iE| Brai 1 le Reader Survey- (I050) 
OFFICE OF PERSONNEL MANAGEMENT 

- OPH Central Personnel Data File (CPDF) (1093) _ 
SOCIAL SECURITY ADMINISTRATION 

• Continuous V/ork History Sample (1030) . . - 

' ' - Longitudinal Sample of Disability 1 nsurance.App 1 i cants (1031) 

- Continuous Disability History Sample (lQ3j2)"" 

- 1969-79 Retirement History Study (1033)"" ^ - 

- 1966 Survey of Disabled Adults (103^)'''' 

- 1967 Survey of institutionalized Adults (1035)'^'' 

- 1968r70 Survey of Mev/ly Entitled Beneficiaries (1036)'-" , , 

- 1969 Followup of Disabled Adults (:iP37) 

- 1971 Survey of Recently Disabled AdQit (IO38) 

- 1972 Survey, of Disabled and Nondisabled Adults 0039)"",^,^ 
. - 197^* Follov/up of Disabled and Nond i sabled . Adu 1 ts (jO^O)-' 

l97/f Survey of Low Income, Aged, and Disabjed (lOUl)"'* 

• 1978 Survey of Disability and Work (lO^tZ-)"" 
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- 1978 Survey' of Blind and Oi sab led Chaldren Receiving Supplemental 

Security Income Benefits ( 10^*3) /..^^n 
. Supplemental Security Income (SSI) Program (HOS) 

— ftet-i-rement, -Sur-v-i-vor-s.,- .and-DxsabJJJ-ty„J,ns.uran.ce..aaxms_SX^^^^^ 1Q,2X 



- Survey of Income and Program Participation (in planning; 



(1060) 

- AFDC Recipient Surveys (ll?^)'"*' ui ^ 

- Feasibility Assessment for a Longitudinal Survey of the Disabled 

• (In planning) (1155)' , ' . „ u u-,-. 

• • -Alternative Practices in Disability Beneficiary Rehabilitation 

(In planning) (115^) - ~. / 

- Mork Incentive Experiments (in planning) (1153) 

- Mec cal Examination Study of Impairments and SSDI Eligibles" 
(in Planning) . (1152} ^ " 

VETEP-ANS ADM I N 1 STRATI ON 

- Veterans Administration Annual Patient Census (IO89) 
" Housing Assistance Program (1106) 

"- -Condensat-ion- and-Pensi on. Reporting System (.•'07} 

- 1971 VA/Harris Survey of Vietnam Era Veter.ans U 7^; 
> 1979'-A/A-/Haws-Su rv e y o f Vif .tnam Era Veterans (1173; 
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■T " ^ATT on a L/N on - ? fPERA L 

o 

t American Association ofi University Affiliated Programs 
,-^Amer.ican-^Associat ion-.of -Uni^vers i ty Af-f i^l i ated Programs- Data -Base (2315)- 

# American Foundation for the Blind (AFB) 

- Print-Limited Population^Survey (2312) 

- Consumer Needs Survey of the Visually Impaired (2311) , 

m Association of State and Territorial Health Officials 

-i National Public Health Program Reporting System (2317 and 2318, see belovv) 

- Inventory of Programs and Expenditures of State ^nd Territorial 
Health Agencies (2317) 

- Haterna! and Child Health Supplement (2318) 

t Epilepsy Foundation of Amer : ca J 

- Basic Statistics oh the Epilepsies (2322) 

# Gallaudet College ^ 

- Annual Survey of Hearing Impaired Children and Youth (2098)'*" 

- National Hearing Impaired Achievement Testing Projects (2099) * 

- Survey of Families of Hearing Impaired Children (2305)" ^ 

6 N ational Association for the Deaf 

- American Annals of the Deaf (2319) 

- National Census of the Deaf Population (2310)"'* 

National Association for Retarded Citizens . * • \ 

1971 Survey of Mentally Retarded Students (2180) 

Followup Survey of Mentally Retarded Citizens (2181) 

0 National Association of State Directors of Special Education 

- State Profiles in Special Education (2332) ^ 

0 N ational Committee for^Citizens in Education 

- Survey on Parent Involvement in the 1 EP Process (233^) 

» National Easter Seal Society ' 

- National Easter Seal Society Program Services Reporting System (2329)" 

f National Tuberous Sclerosis Association, Inc. 
r Xut>^'"ous Sclerosis Medical registry C2335)'' 
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United Cerebral Palsy Association, lnc >, 

- Programming for Atypical Infants and Their Families (2331)*^ 

- Data. Retrieval System (2303) 



PaValyzed Veterans of America 

- Survey of SCI Veterans '13 (2185) ^ c 

Other \ - V 

- Birth-Defects /information System (BDIS) - Tufts-New rEngland Medical 

Center <2323r" ' • - 

- Institute for Chi Id Behavior Research. (I CBR) Data Bank— I nsti tute for 
Child Behavior Research (2328) , , 

- 1972 Ohio State University National Survey— Ohio State University (2065)'* 

- Idiot Savant Survey— N.Y, State Institute for Basic Research (207J) 

- National Survey of Handi cappi ng Conditions Found in Juvenile Correc- 
tional Institutions -,S.C. Department o'f Correctional Institutions (207^) 



* Mtim GEOGRAPHIC AREA 



• Hultis^ate Information •(Mental Health) System-DHEW (306^.)^ - 

. 1968-71 Baseline Surveys of Urban Health Center Service Areas ' (3096) 

• RAND Health Insuraftce Study (for DHEW) (3108)*"""' 

.Multistate" (California. Colorado, Nevada) Survey of Residential Status 
of Developmen tally Disabled Persons (30^4; 

•Experimental Health Services Delivery Systems Family Health Survey 
(for DHEW) (3111) . . 

• collaborative Perinatal Project, .National Institute of Neurological 
and Communicative Diseases and Stroke Ki^m 

CoTiputerized. patient Profi le System. National Foundation of Dentistry 
for the Handicapped (33^0) 

.^||;dividual Data Base (IDB) Henta^l Retardation Research Center (3341)^- 

• Juvenile Rheumatoid Arthritis Da\a Bank. Special Treatment Center for 
Juveni le ArthriUs (33^2)" 

■..Kent infant Development (KID). Scale Protocols. Kent Development Metrics. 
'Kent State University (3343) ^ • , . . 

New York Hospi ta 1 -Cornel 1. Med i cal Center ^^^^^^^^^^ 
Hemophilia Diagnostic and Treatment Center Datg Base (3344) 

. Patient Registry for Actual or Suspected Epilepsy. Compr^ehensi ve ' 
Epi lepsy Program (33^5)" 

. Protection and Adcocacy ,(P ^ A,) Data Retrieval System. Institute for 
Comprehensive Planning (3346) 

. Southeast Registry of ^"^-^1 ' "J,^^'^^-" .f.J?"^'' . 
Regional Center for Deaf-Blind Children (33^7) 

. •1975 Vol lowup of 1968-71 Baseline Survey of Urban Health Center Service 

Areas— NCHSR (3-162) 
. Long-term Care Reimbursement Experiments-Evaluation of Homemaker and 

pay Care Services-NCHSR (3147) . 

• Cost .E.f fects and Benefits Associated wi th Domi ci 1 i ary Care and Inter- 

. mediate Care— Administration on Aging ^3 I I/; • . . 

i Informal Social Networks in Support of Elderly Blacks in the Black ^ 
Bel.t^--^Admi.ni;S.tration on„Agi-ng. A3122;. - - 

• 'independent Lining Rehabilitation (ILR) Projects-Urban Institute (3177) 
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ALASKA . f ■ ^ 

- Registry of Deaf-Blind Children (^^01)" 

ARIZONA . 

-DES-DDDMRS Client information System {kh02)" 

ARKANSAS* " 

- Crippled CHi Idren' s Services Programs (^^03)^'* 
• - Service Management Project (A 133) 

CALIFORNIA / ^ 
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COLORADO . . ^ 
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FLORIDA - 
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■ ^\ ' ■ • '' 

ILLINOIS , • ' 

- Developmental Disabilities Information System (^^07)" 

IOWA _ . ._ . 
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- Services for Handicapped Children- and Youth (4408)" 

KANSAS ^ ' . , 

- Adult Restorative Servi ces--Admi ni s trat ion on Aging (4131) 

KENTUCKY • ' 
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■ . and Handicapped (4075) ' - ■ 
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MASSACHUSETTS • .. ' 
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- Division of Fami-^ly Health Services Cli.ent Data Base (4410)" 

- Massachusetts Service Coordination B^attery (4411)" 
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- Services for Children with Handicaps Management Information 
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NEBRASKA ■ * ^ 
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Ohio'' 
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DISTRICT OF COLUMBIA- • ' ^ 

- The Mental and Nervous Disorder Utilivzation and Cost Survey (5216) 
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MICHIGAN 
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MINNESOTA " ■ ^ 
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Elderly, Phi ladelpliia (51^9): ' ^ . 
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The Definition and Measurement of Disability 
A Selected Bibliography 

This bibliography was prepared as background material for 
Research Memorandum One and is riot intended as exclusive 
coverage of all the materials available in this area. 
Materials chosen simply reflect the purpose of the first 
phase of , the comprehensive, data system pro ject: to review 
the thinking of researchers, policy-makers,. and 
practitioners on how to conceptualize, classify, and measure 
disability. Emphasis is on disability in the generic sense, 
across traditional diagnostic and impairment categories. 
The following and somewhat arbitrary classification was used 
to organize the bibliography: 

"* ft 
I. The Conceot Of Disability — General 



A. . Conceptualization And Classi f ication 

Badiey, Elizabeth M. "Implications of classification 
schemes for information flow in caring systems." Paper 
.presented at symposium of Society for Social .Medicine, 
.September 1976. 

Berkowitz, Edward D., ed. Disabilit y Policies and' 
Governmen t Programs . New York: Praeger, 1979. 

Several chapters focus on definitions . and 
classifications. See especially those by Saad Z. Nagi, 
Edward D. Berkowitz, John J. Dempsey and Jerry 
L. Mashaw. 

Coet, L. T. "Defining term handicap— function of sex, race, 

rel-ig-ion; and geographic location." Psychological 

Reports 41:783-"8'7, 1977. 

• 

deJong, G. Movement for Independent Living; Ideology, and 
Implications for Disability Research (Occasional paper 
ho. 2). Boston: Tufts University Medical Rehabilitation 
Research and Training Center and East Lansing: Michigan 
State University, Center for International 
„..-_Rehabil.i.ta.t.ion.,^.19.7.9. - 

Granger, Carl V. "Fellowship report [to World 

Rehabilitation Fund]." 1979. 

Fellowship project explored British approaches to 
disability terminology development, measurement of 
.severity of disability and uses of epidemiologic methods 
for investigating disability. 
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Uaber, Lawrence D. ' "Disabling effects of chronic disease 
and impairment." Journal of Chronic Disease^ 24:469-87, 

a971. • ~^ 

.Another .edition appears as: "Epidemiological 
factors in disability: I. Major disabling conditions." 
"In Compilation of Social Security Survey of the Disabled 
Population: 1966 TReport no. 6). Washington: 
U.S. Social Security Administration, Office of Research 
and Statistic?, 1977. 

•Haber Lawrence D, ''Haber Session." In transcript of 

- WofR^hop^" C o n f er eTic e — on— D is a b i-H~ty — (-Sa n— - J ua n.,. — L9 6 9J ... 
Sponsored by Americfan Sociological Association, Medical 
Sociology, Section, Committee on Disability, and 
Institute Psicologico de Puerto Rico, 1969. 

.Discussion centers on the role of social agencies 
in labelling disabled clients. f 

Haber, Lawrence. D. "Identifying the disabled : concepts and* 
methods in the measurement of disability." Social 
Security Bulletin '30:17-34 -(Dec), 1S67. 

Haber, Lawrence D. "Social planning for disability." 
Journal of Human Resources 8:33-55 (Supp. , Feb.), 1973. 

Hershenson, David B. "A three-dimensional model ior .client 
evaluation." Rehabilitation Counseling Bulletin 20:308- 
10 (June) , 1977. ' ^ 

Koshel, Jeffrey J. and' Carl V. .Granger. "Rehabilitation 
terminology: who is severely disabled?" Rehabilitation 
• Literature 39:102-06 (April), 1978. 

Nagi, Saad Z. Natural History of_ Disability . Columbus: 
Ohio State University, 1971. 

Nagi> Saad Z. Disability and Rehabilitation — Legal, 
Clinical,, and Self-Concept and Measurement . Columbus, 
Ohi o„S.t at e.JJn i_v.e r.s.i.t y Pr ess. 1969. 

Nagi, Saad Z. Disability In the United States: A Plan for 
an Informational System . Washington: Office for 

- Handicapped Individuals, 1977. 

Nagi, Saad Z. "Epidemiology of disability among adults in 
• the United States." Milbank Memorial Fund Quarterly 
54.:4-39=67-^^(-Fa-l-l-)-, i97-6.— • — 

Nagi, Saad Z. "Gate-keeping decisions in service 
organizations: when validity fails".." Human Organization 
33:4-7-58, 1974. 
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Nagi, Saad Z. "A study in the evaluation of disability and 
rehabilitation potential: concepts, methods, and 
procedures." American Journal of Public Health 54:1568- 
79, 1964. 

Neisworth, John T. and Robert T. Smith. "An analysis and 
-redefinition of . 'developmental disabilities.'" 
Exceptional Children 40:345-47 (Feb.), 1974-. 

Noble, John H. "Social inequity in he' prevalence of 
disability: projections for the year 20.00." In 
Population and Development Problems Relating to 
DisabilrEy -Prevention ,and. JReha.bi-l.i,ta t i on , pp. 23-32. 
Geneva: World Health Organization, 1980 

Rehab, Inc.- An Analytical Appraisal of the Definition of 
Severe Handicap . 2v. Arlington, VA, 1976. 

Salbreux, R., J. M. Deniaud, S. Tomkiewicz and M. Manciaux. 
"Classification and- prevalence of severe and multiple 



.handicaps." Neuropsychiat rie de 
1 ' Adolescence 27,1-2:5-28, 1979. " 



' Enf ance 



et 



de 



Seethamma, Hampapur N. and Ranjit K. Ma^umder.^ New 
rehabilitation law requires a clearer definition of 
handicap." Rehabilitation Literature 40:340-41f (Nov..- 
Dec), 1979. 

Sunshine, Jonathan. Disability (0MB Staff Technical Paper). 
Washington: U.S. Office of Man.agement and Budget, 1979. 

Describes definitions used by, and scope of, 
U.S. disability programs. 

U.S. Working Group on .Uniform Definition of Disability. 
'"Proposals to reduce the inconsistencies in concepts, 
criteria, and definitions of disability and hahdicap-- 
decision memorandum '[and attachments]." Washington: 
U.S. Dept. of Health, Education,. and Welfare. 
•-Assistant Secretary for Planning and Evaluation, 1978. 

- -Decision memorandum, d rafted by Henry Aaron, 

presents findings and recommendations of the Working 
Group. Attachments include John DeWilde' s - "Discussion 
paper': disability definitions," Geoffrey E. Bock's "A 
compilation of definitions pertaining to the handicapped 
and the disabled in federal statutes and regulations," 
and drafts and summary of several agency commentaries on 
*^i]lf proposal s of the Working Gfoupj. 

U.S. National Institute of Handicapped Research. 
Definitions Task Group. "Definitions for NIHR, general 
and specific: government usage." (Draft). Washington, 
n .d. 
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Limitation and Disability— Implications for Practice; 
Research, Program arid Policy Development and Service 
Delivery , Report of the First Mary £• Switze.r Memorial 
Seminar (Cleveland, 1975) . Washington: National 
Rehabi litatibn Assoc iat ion , n ♦"d. 

Wood^ Philip H. N. "Appreciating the consequences 
-disease: the international classification 
impairments, disabilities and handicaps," WHO Chronic 
34, 10:376-80 (Oct.), 1980. 

Wood, Philip 'H. ^N. and Elizabeth *M. Badley. "Setting, 
disablement in perspective. " International 

Rehabilitation Medicine 1, 1:32-37, 1978. 

Wood, Philip H. N. and Elizabeth* M. Badley. People with 
. Disabilities--Toward Acquiring Information Which 
Reflects More Sensitively their Problems and Needs 
(Monograph, i^o. 12) . New York: World- Rehabilitation 
Fund, 1981. 

World -Health Organization. International Classification of 
Diseases . 2v. Geneva, 1967. 

World Health Organiz^at ion . International Classification of 
Impairments , Disa'bilities , and Handicaos: ^ A Manual of 
Classi f ication * Relating to t he Consequences of Disease. 
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Manual "published^ for trial purposes in accordance 
with resolution WHA 29.35 of the Twenty-Ninth World 
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Ableson, Janet. "Measurement of disability in the Canada 
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Attkisson, C. C, W. A. Hargreaves, and W. A. Horowitz eds. 
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